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PREFACE 


This  progress  report  appears  in  two  parts.  Part  A  is  a  summary  of  work  done  in  support  ^ 
of  this  program  at  the  Naval  Undersea  Center.  Part  B  contains  final  reports  submitted  by 
contracts  in  support  of  this  program. 

The  major  milestone  achieved  during  this  period  is  the  demonstration  of  the  TV  band¬ 
width  reduction  system  CCD  implementation.  Still  pictures  and  video  tapes  are  produced 
at  various  bit  rates.  System  specifications  are  documented  and  contracts  and  negotiations 
are  in  progress  to  have  a  flight  system  produced.  This  system  is  described  in  Appendix  A. 

Also  accomplished  are  simulation  studies  on  more  advanced  bandwidth  reduction  sys¬ 
tems  that  take  advantage  of  both  spatial  and  temporal  correlation  in  TV  images.  Results 
of  this  work  are  described  in  Appendices  B  and  C. 

The  algorithm  for  the  prime  cosine  transform  is  documented  in  Appendix  D,  and  a  sur¬ 
face  wave  prime  Fourier  transform  device  is  implemented  for  evaluation  of  the  algorithm 
in  Appendix  E. 

Hardware  developed  under  this  program  is  shown  to  have  applications  in  many  areas  of 
signal  processing  and  techniques  for  modularly  expanding  the  basic  building  blocks  of  this 
hardware  into  larger  systems  are  investigated  in  Appendix  F.  Several  signal  processing 
architectures  are  described  in  Appendix  G. 

Simulation  studies  by  USC  and  others  document  that  the  Cosine  transform  is  very  close 
to  the  Karhunen-Loeve  transform  when  applied  to  image  data.  These  results  are  obtained 
from  experimental  data  but  as  yet  no  analytical  explanation  is  available.  However,  NUC 
has  undertaken  to  give  a  theoretical  basis  for  the  performance  of  the  cosine  transform  and 
the  results  are  presented  in  Appendix  H. 

Papers  have  been  presented  at  several  conferences  on  the  BBD  and  CCD  bandwidth 
reduction  systems  and  are  included  as  Appendices  1,  and  J. 

Work  is  also  conducted  on  a  frequency  hopping  modem  and  some  preliminary  results 
are  presented  in  Appendices  K  and  L. 

For  ease  of  content  location,  the  enclosed  appendices  are  numbered  consecutively  in 
Arabic  numerals  beginning  with  the  title  page  of  Appendix  A  and  continuing  to  the  end 
of  Appendix  L. 
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HYBRID  COSINF./DPCM  TV  BANDWIDTH  RFDUCTION  HARDWARE 


by  Robert  Means  and  E.  H.  Wreneli  Jr. 


Following  is  a  technical  description  of  the  bandwidth  compression  system  developed  at 
the  Naval  Undersea  Center.  This  paper  is  an  e.xcerpt  from  the  specifications  for  a  contract 
currently  under  negotiation  to  produce  flight  hardware  for  the  Army  AQUILA  RPV. 
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Section  I.  Bandwidth  Reduction  System  Specifications 


1.1  The  item  required  is  a  system  which  can  reduce  the  data  required  to  trans¬ 
mit  television  images  over  a  digital  communication  modem  to  the  rates  specified 
below.  The  system  performance,  in  terms  of  picture  quality  and  data  rate,  must 
approach  that  observed  in  computer  simulations.  The  system  must  be  capable  of 
accepting  EIA  monochrome  television  signals. 

1.2  Considerable  simulation,  analysis  and  breadboard  experimentation  have  been 
done  on  the  problem  of  optimizing  the  bandwidth  compression  that  can  be  obtained 
by  a  real  time  system  within  a  small  power,  weight,  size  and  cost  constraint. 

The  system  will  be  used  in  the  Aqulla  mini  remotely  piloted  vehicle.  Figure 
1.1  is  a  block  diagram  of  the  hardware  required.  It  must  accept  composite  video 
in  the  flight  hardware  (transmitter)  and  return  composite  video  at  the  ground 
station  (receiver).  The  data  rates  out  of  the  source  encoder  are  specified 
below.  Contained  at  the  end  of  this  paper  are  schematics  of  the  subsystem  which 
have  been  Implemented  and  tested  at  the  Naval  Undersea  Center.  These  are  provided 
for  informational  purposes.  They  should  be  considered  as  design  guidelines  which 
would  aid  the  contractor  in  meeting  system  specifications. 


SOURCE  ENCODER  SOURCE  DECODER 


■DISPLAY 


Figure  1.1 


1.3  Each  specification  will  be  expanded  upon  in  a  separate  section  where 
appropriate.  Some  of  these  requirements  are  rigid  and  must  be  adhered  to  by 
the  contractor.  Other  requirements  are  such  that  considerable  design  effort, 
invention  and  ingenuity  must  be  expended  by  the  contractor  to  meet  the  perfor¬ 
mance  specifications.  It  is  realized  by  the  government  that  the  required  sys¬ 
tem  imposes  performance  specifications  that  will  require  novel  techniques  to 
achieve  and  that  there  are  tradeoffs  between  power,  weight,  cost  and  performance. 
Specifications  will  be  found  in  each  section.  A  partial  list  follows 

1.4  Timing 

1)  The  number  of  picture  elements  in  each  horizontal  line  must  not  be 
appreciably  less  than  256. 

2)  The  number  of  lines  per  field  must  be  262.5. 

3)  The  frame  rate  must  be  7.5  frames  per  second  with  the  capability  of 
operating  at  a  reduced  frame  rate  with  a  minimum  of  system  redesign.  Some  design 
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effort  should  be  expended  to  insure  that  the  system  would  not  be  completely 
incompatable  with  a  solid  state  sensor  operating  in  the  snapshot  mode. 

4)  The  video  signal  must  be  transformed  in  horizontal  blocks  of  either 
31  or  32  pixels. 

5)  Video  sample  rate  =  4.8  MHz. 

6)  System  clock  frequency  =  9.6  MHz. 

7)  There  must  be  a  sync  output  pulse  every  8  fields. 

1.5  Discrete  Cosine  Transform 

1)  The  horizontal  transform  must  be  a  discrete  cosine  transform. 

2)  Dynamic  range  of  output  coefficients  must  be  ^  2®, 

3)  Peak-to-peak  signal  to  peak-to-peak  noise  of  output  coefficients  must 
be  >  2®. 

4)  Transform  coefficients  must  be  accurate  to  within  one  percent  over 
the  whole  dynamic  range  of  the  transform. 

1.6  Differential  Pulse  Code  Modulator 

1)  The  vertical  transform  must  be  a  differential  pulse  code  modulator 
operating  on  cosine  transform  coefficients. 

2)  Quantizer  must  be  within  the  DPCM  feedback  loop. 

3)  Dynamic  range  at  input  must  be  2®. 

4)  Maximum  error  at  output  with  steady  state  input  signal  must  not 
exceed  one  least  significant  bit  when  quantized. at  6  bits  per  coefficient. 

5)  Quantization  levels  must  be  spaced  for  equal  probability  of  occupancy 
at  all  bit  rates. 

6)  DPCM  must  be  capable  of  variable  bit  assignment  per  coefficient  from 
zero  to  six  bits. 

7)  Output  rate  must  be  continuous  serial  capable  of  switching  between 
200,  400,  800  and  1600  kilobits  per  second  on  command  from  the  modem. 

1.7  Ground  Station 

1)  The  ground  station  must  be  implemented  with  digital  hardware. 

2)  Power  and  weight  are  not  critical  factors  in  the  ground  station. 

1.8  Form  Factors 

1)  The  system  must  fit  on  one  (1)  card,  the  size  of  which  is  approxi¬ 
mately  4"  X  8"  x  1/2".  Exact  form  factors  will  be  specified  after  award  of 
contract . 

2)  The  system  must  have  a  weight  of  less  than  one  pound. 

3)  The  system  must  dissipate  less  then  ten  watts. 

4)  Environemntal  specifications  are  those  specified  for  the  Aquila 
Lockheed  RPV. 


1.9  Administrative 

1)  There  will  be  design  review  meetings  held  every  90  days  at  the  con¬ 
tractors  plant. 

2)  Production  cost  estimates  of  the  system  are  requested  with  a  produc¬ 
tion  run  of  1000  a  year  for  5  years  to  be  used  as  a  basis. 

3)  A  bid  is  requested  on  a  package  consisting  of  two  airbome  subsystems 
and  one  ground  station. 

4)  A  bid  is  requested  on  a  package  consisting  of  3  airborne  subsystems 
and  two  ground  stations. 


5)  Proposal  evaluation  criteria  are  listed  in  order  of  importance. 

a)  weight,  site,  power 

b)  performance  and  design 

c)  production  cost 

d)  contractor  experience  and  personnel 

e)  risks  inherent  in  design 

f)  bid  prices 

6)  Delivery  date  is  9-10  months  after  award  of  contract. 
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Section  II.  Timing  and  Synchronization 


2.1  The  TV  bandwidth  reduction  system  must  operate  synchronously  with  the  TV 
camera  and  the  Harris  modem.  The  clocks  required  to  insure  compatibility  have 
been  determined  and  will  be  supplied  by  Harris.  These  clocks  will  be  available 
to  operate  both  in  the  plane  and  on  the  ground  at  synchronize  time  and  phase 
relationships . 

2.2  The  TV  sync  in  the  plane  will  be  supplied  by  the  camera.  On  the  ground 
the  TV  sync  must  be  reconstructed  by  the  bandwidth  reduction  system  from  a 
timing  pulse  transmitted  from  the  plane  and  the  system  clocks.  The  detailed 
operation  of  the  system  timing  is  described  below. 

2.3  In  the  plane,  the  TV  sync  is  generated  by  a  TV  sync  generator  chip  that  is 
driven  by  a  2.0  MHz  clock.  The  2.0  MHz  clock  is  supplied  by  Harris  and  is 
derived  from  the  system  clock.  (2  MHz  =  4.8  MHz  r  12  x  5.)  Each  TV  line  is 
composed  of  130  of  the  2  MHz  clocks  for  a  line  time  of  65.0  ys.  The  V  and  H 
signals  generated  by  the  sync  chip  are  supplied  to  the  bandwidth  reduction 
system. 

2.4  The  basic  sample  clock  rate  for  the  bandwidth  reduction  system  is  4.8  MHz 
and  is  derived  from  a  9.6  MHz  clock  supplied  by  Harris.  The  TV  line  time  cor¬ 
responds  to  312  clocks  at  4,8  MHz.  Of  these,  256  correspond  to  the  active  TV 
line,  and  56  to  retrace  time.  The  even  field  is  processed  as  though  it  were 
identical  to  the  odd  field,  so  the  vertical  resolution  is  262.5  lines. 

2.5  The  phase  relation  of  the  horizontal  sync  pulse  to  the  4,8  MHz  clock  is 
not  known  but  will  remain  constant  since  the  horizontal  sync  is  derived  from 
the  2  MHz  clock  which  in  turn  is  derived  from  the  4.8  MHz  clock.  Therefore 
each  TV  line  will  start  at  the  same  time  relative  to  the  4.8  MHz  sample  clock. 
This  permits  the  4.8  MHz  clock  to  be  used  to  partition  the  active  TV  lines 
into  256  pixel  blocks.  The  pixels  are  numbered  with  0  on  the  left  and  255  on 
the  right. 

2.6  The  effective  frame  rate  of  the  TV  is  reduced  by  a  factor  of  eight  by  only 
coding  1/8  of  a  TV  line  during  each  horizontal  time  interval  (65  ys) .  Eight 

TV  fields  must  be  swept  out  by  the  camera  before  one  entire  field  has  been 
coded.  During  the  first  field,  pixels  0-31  of  all  lines  will  be  coded  and 
transmitted.  During  the  second  field  pixels  32-63  will  be  coded,  etc. 

2.7  The  blocks  of  32  pixels  form  vertical  bars  in  the  TV  field  are  referred  to 
as  stripes.  These  stripes  are  numbered  0-7  with  stripe  zero  containing  pixels 
0-31.  A  stripe  counter  is  incremented  by  the  vertical  sync  pulse  obtained 
from  the  camera. 
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2.8  A  pulse  that  exactly  describes  the  beginning  of  stripe  zero  (frame  sync) 
is  used  to  obtain  video  synchronization  between  the  plane  and  the  ground.  The 
frame  sync  pulse  (7.5  Hz  rate)  is  sent  to  the  status  modem  in  the  plane  and  then 
transmitted  to  the  ground.  At  the  ground  the  pulse  is  obtained  from  the  status 
modem  with  its  time  relative  to  the  data  preserved.  The  frame  sync  is  then 
used  to  synchronize  the  pixel  counter,  line  counter,  and  stripe  counter  on  the 
ground  with  those  in  the  plane.  It  is  also  used  to  lock  a  TV  sync  chip  (required 
to  produce  composite  video)  on  the  ground  with  the  one  in  the  camera.  Therefore, 
complete  synchronization  between  air  and  ground  video  is  maintained  by  the  frame 
sync  signal. 

2.9  Another  area  where  synchronization  is  required  is  the  interface  between 

the  TV  bandwidth  compression  system  and  the  Harris  modem.  The  interface  requires 
that  serial  data  be  passed  from  the  TV  system  to  Harris  at  four  fixed  rates:  200, 
400,  800,  and  1600  K  bits/sec.  This  corresponds  to,  respectively,  13,  26,  52,  and 
104  bits/TV  line.  The  output  bits  are  to  be  synchronous  with  a  data  clock  supplied 
by  Harris  at  the  bit  rate.  The  phase  of  this  clock  relative  to  the  4.8  MHz 
clock  is  not  known  but  will  remain  fixed  at  each  of  the  four  frequencies. 

2.10  The  interface  at  the  ground  between  the  modem  and  the  TV  system  is 
identical  to  the  one  just  described.  The  data  from  the  modem  is  presented 
serially  and  synchronously  with  the  data  clock  from  the  modem.  The  data  rate 
at  the  interface  is  determined  by  the  state  of  2  bits  (x5y)  supplied  to  the 
system  by  the  command  and  control  modem. 
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Section  III.  Discrete  Cosine  Transform 


3.1  There  are  two  discrete  cosine  transforms  which  can  be  defined  for  a  finite 
block  size  data  set.  They  have  been  called  the  odd  discrete  cosine  transform 
(ODCT)  and  the  even  discrete  cosine  transform  (EDCT) .  For  a  complete  discus¬ 
sion  of  the  differences  see  Appendix  A. 

3.2  There  exists  many  possible  algorithms  to  implement  the  discrete  cosine 
transform.  It  is  not  the  purpose  of  this  procurement  to  specify  a  given  al¬ 
gorithm.  However,  the  hardware  implementations  of  each  approach  considered 
by  the  Naval  Undersea  Center  will  be  discussed. 


Chirp  Z  Algorithm 

3.3  Consider  the  odd  cosine  transform.  The  chirp  Z  algorithm  computes  the 
coefficients  of  a  32  point  data  block  by  implementing  the  equation 


G 


k 


-i7rk^/63 
e  L 

n=0 


-iTm^/63  +iiTfk-n)^/63  .v 

e  ®  h 


where 


.  .  I*  ■ » 

~  i 

Ig^  k  =  1...31 


A  block  diagram  of  the  hardware  is  shown  in  figure  3.1.  This  algorithm  was 
implemented  by  the  Naval  Undersea  Center  with  charge  coupled  device  transversal 
filters  developed  by  Texas  Instruments.  It  and  the  rest  of  the  bandwidth  com¬ 
pression  system  has  been  demonstrated  with  real  time  video.  The  complete 
schematics  for  the  system  are  attached  at  the  end  of  this  paper. 
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Figure  3.1.  Block  Diagram  of  Chip  Z  Algorithm  Hardware. 
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3.4  The  hardware  consists  of  three  major  blocks;  The  premultipliers,  the 
transversal  filters,  and  the  poscmultlpliers.  All  three  blocks  performed 
marginally  at  the  4.8  MHz  video  sample  rate.  The  system  performance  was  close 
to,  but  did  not  equal,  that  of  the  computer  simulations. 

3.5  The  breadboard  system  built  by  NUC  used  an  LSI  chip  developed  by  Texas 
Instruments.  All  the  premultipliers  and  differential  current  integraters  were 
placed  on  the  device.  The  transversal  filters  themselves  performed  adequately 
at  the  sample  rate  of  interest  (4.8  MHz),  but  the  peripheral  devices  -  multi¬ 
pliers  and  differential  current  integrators  -  did  not.  In  the  breadboarded 
system  these  were  implemented  by  separate  modules  or  discrete  components.  It 
was  the  off  chip  modules  which  determiried  the  size  of  the  laboratory  bread- 
boarded  system.  In  order  to  meet  the  contract  specifications  it  is  necessary 
to  put  as  many  of  these  peripheral  devices  on  the  chip  as  possible;  i.e.  a 
new  CCD  chip  is  required. 


3.6  Since  the  processor  has  a  complete  line  to  process  32  samples,  it  is 
possible  to  capture  the  video  in  an  analog  delay  line  or  memory  at  high  sample 
rate  (4.8  MHz)  and  read  it  out  into  the  processor  at  a  slow  rate  (1.2  MHz), 

This  option  would  relax  the  speed  requirements  on  the  multipliers,  differential 
current  amplifiers,  and  CCD's.  It  will  also  relax  the  requirements  on  the 
DPCM.  It  could,  however,  add  a  separate  piece  of  hardware  to  the  system. 


3.7  The  premultiplier  should  have  an  accuracy  of  at  least  6  bits.  It  is  pre¬ 
sumed  that  the  video  is  limited  to  6  bit  accuracy,  probably  in  the  display. 

The  transform  coefficients  should  have  a  dynamic  range  of  at  least  eight  bits. 


Prime  transform  algorithm 

3.8  The  prime  transform  algorithm  can  be  implemented  by  the  hardware  shown 
in  figure  2.  The  prime  transform  requires  the  use  of  a  permuting  analog  memory, 
a  transversal  filter  and  a  second  permuting  memory  at  the  output.  In  addition, 
it  requires  a  separate  computation  of  the  D.C.  coefficient.  Also  the  first 
video  sample  must  be  treated  different  than  all  the  rest.  An  analog  memory 
chip  has  been  developed  by  the  Ret icon  corporation  under  a  contract  by  the 
Naval  Undersea  Center.  This  system  has  not  been  breadboarded  by  the  Naval 
Undersea  Center  and  the  memory  chip  has  not  been  sufficiently  tested.  However 
the  performance  specifications  claimed  by  Reticon  are  attached  in  the  appendices. 
The  prime  algorithm  for  a  Toiricr  transform,  is 
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and  R  is  a  primitive  root  of  N. 
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Figure  3.2.  Prlae  Tranefora  Algorltha. 


Direct 
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3.9  The  calculation  of  the  discrete  cosine  transfon  could  be  done  by  straight 
forward  digital  techniques.  Two  aethods  of  perforaing  this  digital  cosine 
transfoxa  have  been  investigated.  The  first  aethod  is  a  direct  iapleaentation 
of  the  algoritha  while  the  secmd  involves  a  trigonoaetric  substitution  to 
reduce  the  coaplexity  of  the  calculation. 


3.10.  The  odd  discrete  cosine  trensfora  of  the  fora 


has  been  directly  i^>l«Bented  for  N■S2.  The  design  uses  a  read-only-aeaory  to 
store  the  basis  vectors  and  perforas  the  aultiplication  using  LSI  aultipliers. 
(See  Figure  3.S.)  Due  to  the  liaitation  of  a  clock  frequency  of  9.6  Mix  and  a 
aarlaiM  of  6S  iisec  in  which  to  perfoxa  the  calculation,  only  the  first  19  coef¬ 
ficients  can  be  calculated.  In  order  to  obtain  all  52  coefficients,  it  is 
necessary  to  have  at  least  2  parallel  processors  operating  together. 

3.11  The  transfora  can  be  laplenented  without  having  to  do  any  aultlplles  by 
the  substitution  of  variable  Fnacos'^gh.  The  transfoxa  then  becoaes 

®k  “  co»  'o  *  *  Jj  *  V  ^  ‘  •  0,1. ..M-l 
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By  using  this  method,  the  original  transform  requires  only  additions.  (See 
Figure  3.4.)  The  values  of  the  cosine,  arccosine,  and  Z^rnk/ZN-l  are  contained 
in  read-only-memories.  This  arccosine  method  has  been  implemented  for  N=32 
and  requires  about  the  same  number  of  parts  as  the  direct  method.  Since  it  is 
limited  by  the  same  constraints  of  clock  frequency  and  calculation  time,  it 
also  can  calculate  only  the  first  19  of  32  coefficients.  However,  a  unit  has 
been  built  using  two  processors  in  parallel  that  can  calculate  all  32  coeffi¬ 
cients  by  having  one  processor  calculate  Gq  thru  G15  while  the  second  processor 
calculates  Gjg  thru  G31  at  the  same  time.  An  overflow  limiting  feature  has 
been  included  in  this  design.  This  feature  makes  it  possible  to  experiment 
with  the  selection  of  bits  to  be  used  for  the  output  without  suffering  large 
errors  from  an  overflow  condition.  This  dual  processor  unit  requires  about  60% 
more  parts  than  the  single  unit.  Both  the  direct  implementation  and  the 
arccosine  method  units  use  double  buffering  at  both  the  input  and  output  so 
that  the  processor  can  be  operating  on  data  while  other  data  is  read  in  and 
results  read  out.  This  increases  the  time  available  for  calculation.  Sche¬ 
matics  for  these  systems  are  attached  at  the  end  of  this  paper. 
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Section  IV.  Prewhitening  Filter 


4.1  There  are  two  reasons  to  use  a  prewhitening  filter  as  the  first  operation 
on  the  video  signal. 

4.2  One:  Since  the  cosine  transform  compacts  the  sampled  video  into  the  low 
order  transform  coefficients,  and  the  subsystem  which  calculates  the  cosine 
transform  has  a  limited  dynamic  range,  the  high  frequency  coefficients  will 
have  a  poor  signal  to  noise  ratio.  One  method  of  increasing  the  signal  to 
noise  ratio  for  the  high  frequency  coefficients  is  to  filter  the  video  to  empha¬ 
sise  the  high  frequency  information.  The  phase  requirements  and  invertability 
of  this  filter  and  their  implications  will  be  discussed  later. 

4.3  Two:  The  DPCM  requires  a  non-linear  quantizer.  The  choice  of  the  quanti¬ 
zation  levels  is  extremely  important  to  the  resulting  picture  quality.  The 
levels  are  chosen  by  examining  the  probability  distribution  for  the  difference 
between  transform  coefficients.  Since  the  variance  of  the  transform  coeffici¬ 
ents  decreases  as  a  function  of  n (coefficient  number),  each  coefficient  requires 
a  separate  quantizer.  This  requirement  can  be  eliminated  by  normalizing  all 
the  coefficients  to  have  the  same  variance.  The  normalization  of  the  coeffici¬ 
ents  can  be  done  by  multiplying  each  coefficient  by  the  appropriate  scale  factor 
as  it  enters  the  DPCM  subsystem.  However,  the  high  frequency  coefficients 
would  still  have  a  poor  sig:nal  to  noise  ratio.  Instead,  since  this  operation 

on  the  transform  coefficients  is  equivalent  to  a  frequency  domain  multiply,  one 
can  inqjlement  the  scaling  necessary  for  the  DPCM  by  a  prewhitening  filter  direc¬ 
tly  on  the  video. 

Design  considerations  of  the  prewhitening  filter 

4.4  Let  us  assume  that  the  variance  of  the  cosine  transform  coefficients 
decreases  proportional  to  2/(n+2).  (This  is  a  fair  approximation  for  most  TV 
scenes.)  The  "ideal"  prewhitening  filter  would  multiply  the  coefficients  by 
(n+2)/2.  This  filter  is  very  difficult  to  implement  in  the  time  domain.  How¬ 
ever,  one  can  implement  a  simple  filter  which  approximates  the  "ideal"  function. 
The  difference  between  the  approximation  and  the  "ideal"  equalization  can  be 
removed  by  the  use  of  a  multiplier  before  the  DPCM.  This  multiplier  then  oper¬ 
ates  on  coefficients  with  good  signal  to  noise  ratio  and  only  has  to  multiply 
coefficients  by  a  relatively  small  factor.  A  simple  example  of  a  prefilter 
which  can  be  implemented  by  a  delay  line  and  a  differential  amplifier  Is  shown 
in  figure  4.1 
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Figure  4.1.  Simple  Example  of  a  Pref liter. 


The  impulse  response  of  this  filter  is  also  shown.  In  the  frequency  domain 
this  filter  multiplies  the  Fourier  transform  coefficients  by  the  transfer  func¬ 
tion 


HCf)  =  1  -a*  2i  sin  irfAr 


4.5  There  are  severe  limitations  on  the  filter  phase  distortion  allowed.  The 
filter  in  the  example  has  complex  phase  shift.  It  is  not  possible  to  compute 
a  cosine  transform  of  the  video  and  use  this  prefilter.  If  ot=1.0,  then  it  is 
possible  to  con^jensate  for  the  purely  imaginary  phase  shift,  but  it  introduces 
problems  with  dc  signals.  The  filter  should  multiply  the  variance  of  the  cosine 
transform  coefficients  by  the  transfer  function  of  the  filter.  Since  the  cosine 
transform  of  the  video  is  the  Fourier  transform  of  the  extended  data  set  (Appen¬ 
dix  A)  one  can  define: 

g(t)  =  sampled  video  data  set 

gg(t)  =  extended  data  set  =  g(t)  g(-t) 

h(t)  =  impulse  response  of  the  prefilter 

-  Fourier  transform  of  g(t)  =  G(f) 

C(g(t))  =  cosine  transform  of  g(t)  =  F(g(t)  +  g(-t))  =  G(f)  ♦  G*(f) 


Thus 


C(h(t)  *  g(t))  =  F(h(t)  *  g(t)  +  h(-t)  *  g(-t))  (1) 

=  H(f)  G  (f)  +  H*  (f)  G*  (f) 

=  Re  H(f)  G  (f)  =  Re  F(h  *  g) 


If  H(f)  is  purely  real,  then  the  cosine  transform  of  h  *  g  can  be  expressed  as 


C(h  *  g)  =  H(f)  [G  +  G*] 


(2) 


Thus  we  have  multiplied  the  cosine  transform  coefficients  by  the  transfer 
function  of  the  prefilter.  It  appears  that  a  prefilter  with  a  real  transfer 
function  is  the  simplest  one  to  achieve  the  proper  whitening.  Such  a  filter 
is  slightly  more  complicated  to  build.  The  impulse  response  of  such  a  filter 
is  given  in  figure  4.2 
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Figure  4.2.  Filter  Impulse  Response. 


In  the  frequency  domain  this  filter  has  the  transfer  function 


H(f)  =  l-2a  cos  2TrfAT 

4.6  It  has  been  assumed  that  the  inverse  filter  will  be  implemented  digitally 
in  the  ground  station.  The  inverse  of  the  filter  shown  in  figure  4  cannot 
be  implemented  easily.  However,  this  is  at  the  ground  station.  The  impulse 
response  is  shown  in  figure  4.3 

1.0  I 


Figure  4.3 


where 

6  =  ^  (1-  /lT4a*)  for  a  <  0.5 


This  inverse  can  be  approximately  implemented  by  the  hardware  shown  in  figure  4.4 

/9 


4.7  The  accuracy  of  this  filter  will  depend  on  the  n-jmber  of  stages  in  the 
nonrecursive  part  and  the  value  of  8.  For  example:  for  a  pre-emphasis  filter 
with  a»0.4,  then  6*C.5  and  with  eight  stages  in  the  non-recursive  filter,  the 
accuracy  of  the  output  will  be  greater  than  one  part  in  2®.  Output  is  valid 
after  n-^l  stages. 

4.8  It  appears  to  be  preferable  to  implement  the  prefilter  which  has  a  real 
transfer  function  for  several  reasons:  the  cosine  transform  does  not  have  to 
be  modified;  noise  does  not  accumulate  in  the  inverse  filter;  and,  most  impor¬ 
tant,  a  dc  bias  on  the  video  signal  reaches  the  cosine  transform  as  dc. 
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4.9  The  discussion  of  the  prefilter  in  this  section  has  gone  into  detail  on 
a  number  of  alternative  designs.  Attention  was  paid  to  the  impact  on  the 
algorithms.  It  was  not  meant  to  give  the  impression  that  the  prefilter  design 
is  extremely  critical  to  picture  quality.  Several  different  prefilters  have 
been  implemented  by  NUC  and  the  differences  in  picture  quality  were  observed 
to  be  small.  It  was  important,  however,  to  implement  some  type  of  pre-emphasis 
and  de-emphasis.  The  pre-emphasis  should  process  the  whole  line  of  video  so 
that  the  de-emphasis  filter  operates  across  the  boundaries  of  the  horizontal 
interlace  used  in  the  system. 


Section  V. 


Differential  Pulse  Code  Modulator 


5.1  The  differential  pulse  code  nodulator  (DPCM)  is  the  system  element  that 
produces  the  actual  bandwidth  reduction.  The  processes  of  prewhitening  and 
cosine  transfering  are  exact  and  invertable  and  therefore  no  bandwidth  reduc¬ 
tion  occurs.  The  DPCM  however  is  not  exactly  invertable  because  approximations 
are  made  in  the  quantization  of  the  differences  between  coefficients.  The  use 
of  the  approximate  difference  results  in  a  reduction  in  the  amount  of  data 
required  to  transmit  the  coefficients.  The  transmitted  data  rate  is  varied 

by  changing  the  number  of  quantization  levels  used  for  each  coefficient. 

5.2  The  basic  structure  of  the  DPCM  is  as  shown  below. 


The  operation  is  as  follows.  The  DCT  coefficients  (Gn's)  enter  in  time  sequence, 
and  are  differenced  with  a  fraction  (a)  of  the  corresiwnding  coefficient  from 
the  previous  line  (G|^'s).  The  difference  0,1  is  then  approximated  by  one  of  2*^ 
possible  values  by  the  quantizer.  The  value  of  k  for  each  coefficient  is  pro¬ 
grammed  in  a  ROM.  The  k  bits  from  the  quantizer  that  define  the  quantization 
level  are  then  transmitted  serially  to  the  modem.  The  quantized  difference, 

'0„,  is  then  added  to  a  *  Cu  to  produce  an  approximation,  G„,  of  the  input 
coefficient  This  value"is  stored  in  the  delay  memory  to  be  used  as  in 
the  next  line. 

5.3  It  is  iq>ortant  to  note  that  by  placing  the  quantizer  inside  the  feedback 
loop  the  quantization  error  is  kept  from  accumulating,  and  the  error  converges 
to  the  smallest  quantization  level  in  steady  state. 

5.4  The  most  critical  portion  of  the  DPCM  is  the  design  of  the  quantizer.  For 
the  coding  of  the  coefficient  differences  to  be  efficient,  all  quantization 
levels  must  be  equally  likely.  It  has  been  shorn  by  Habibi  (Appendix  J  references) 
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and  others  that  for  a  wide  variety  of  pictures,  the  amplitudes  of  the  coeffi¬ 
cient  differences  are  approximately  exponentially  distributed.  The  probability 
distribution  for  the  difference.*;  is  therefore  assumed  to  be 


f(Dn)  “  “n  * 

where  Oj,  is  the  variance  of  the  nth  coefficient.  Since  the  coefficients  have 
been  prewhitened,  all  otjj’s  are  approximately  equal.  The  distribution  is  sym¬ 
metric  about  zero,  so  only  the  positive  quantization  levels  need  be  calculated 
The  probabilities  of  a  difference  falling  into  any  quantization  level  must  be 
equal,  i.e. 

Jbi  '  pr 

k 

where  2  is  the  number  of  quantization  levels  and  bj  is  the  boundary  between 
the  i-1  and  ith  quantization  level.  For  the  positive  quantization  levels 


f(x)  dx 


i 


1  2^"^ 

or  b.  =  i  In  J 

^  2  -i 

There  are  also  boundaries  at  zero  and  at  negative  b^. 

5.5  Once  the  quantizer  boundaries  are  determined,  the  quantization  values,  Q^,  are 
chosen  to  minimize  the  mean  square  quantization  error,  i.e., 


=  Minimum  where  is  an  input  to  the 
quantizer  that  falls  between  b^.j 
and  bi- 


The  value  of  is  determined  by  setting 
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X.  =  Q.  where  x.  is  the  mean  of  x.  for  contained  ii 
^  ^  between  bj.i  and  in  quantization  level  i. 


n 

j  b.  -xf(x)dx 
g.  =  JLi - 

J  f(x)  dx 
‘^i-1 


5.6  The  DPCM  should  have  an  input  dynamic  range  of  +2  .  This  implies  that 
the  difference  between  coefficients  has  a  dynamic  range  of  ±2®.  The  input 
gain  of  the  DPCM  should  be  set  such  that  the  probability  of  the  magnitude  of 
the  coefficient  difference  exceeding  2®  is  1/2P  where  P=knjax  ^ largest 
number  of  bits  that  can  be  used  in  the  quantizer)  i.e. 


L  J,8 


,8 


-a*2  1 


a  •  2-'  ■  In  2?*' 


For  the  DPCM  r^uired  in  this  specification,  p=6.  Thus  a=0.0162.  The  quanti¬ 
zation  boundaries  and  mean  values  can  be  determined  from  equation  A  and  B.  The 
values  are  given  in  Table  1.  It  should  be  noted  that  to  determine  between 
which  quantization  level  a  difference  falls,  for  k  bits  of  quantization  (k<6) , 
it  is  necessary  only  to  truncate  the  six  bit  quantization  number  to  k  bits. 

This  will  become  clear  by  examining  Table  1. 


QUm4T!ZER 

POSITIVE  VALUES 

FOP  64  LEVELS 

QUfWITlZER 

QUAUTIZEP 

TPPUSPITTE® 

BQUf^D 

VI^UE 

BITS 

0 

1 

000000 

81 

oloin 

a 

85 

3 

000001 

89 

011000 

4 

94 

5 

000010 

98 

011001 

6 

103 

7 

000011 

108 

011010 

8 

>  14 

9 

000100 

121 

011011 

10 

128 

11 

000101 

137 

011100 

13 

146 

14 

000110 

158 

011101 

15 

171 

16 

000111 

190 

011110 

18 

213 

19 

001000 

275 

011x11 

20 

21 

001001 

23 

24 

001010 

26 

27 

001  on 

29 

30 

001100 

32 

33 

001101 

35 

37 

o 

o 

o 

39 

41 

001111 

43 

45 

010000 

47 

49 

010001 

51 

53 

010010 

55 

57 

010011 

60 

63 

010100 

66 

69 

010101 

72 

75 

olono 

^'  79 

TABLE  1.  DPCM  QUANTIZER  VALUES 
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QURMTIZEI?  POSITIVE  VRl.L€S  FOP  3£  LEVELS 


OURUTIZEP 

OU«f47IZEP 

TPW<SHI7 

Bourse 

VRLUE 

BITS 

0 

2 

CCOOO 

4 

6 

0000 1 

t 

10 

00010 

A 

IS. 

0001  1 

IS 

£0 

00100 

£3 

Z6 

00101 

29 

32 

00110 

35 

39 

00111 

43 

47 

01  000 

5i 

35 

OlOOl’ 

60 

66 

01010 

72 

78 

01011 

85 

94 

01100 

103 

i  15 

OllOi 

128 

147 

OHIO 

171 

233 

01111 

TABLE  1.  DPCM  QUANTIZER  VALUES  (Cont'd) 


QIJfir<7 1 ZER 

PaSI7IVE  VRLUES 

FOR  16  LEVELS 

QLlRr<7IZEP 

13URH7IZER 

7RRMSHI77ED 

BDUnS 

VRLUE 

BI7S 

0 

4 

0000 

8 

13 

0001 

18 

23 

0010 

29 

36 

0011 

4  3 

51 

0100 

60 

72 

0101 

85 

104 

0110 

128 

190 

0111 

QURT17 1 ZER 

ROSI7IVE  VFK-UES  FOR  8  LEVELS 

0Uflr^7I^ER 

QURt<7IZER 

7Rftr<S«I77EII 

BOLIfID 

VRLUE 

BI7S 

0 

9 

000 

00 

30 

001 

43 

62 

010 

85 

147 

on 

TABLE  1. 

DPCM  QUANTIZER  VALUES 

(Cont'd) . 
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QURH7IZER  POSITIVE  VRLUES  FOP  4  LEVELS 


0URMTI2EP 

BOUr^D 


OURf<7IZEP 

VRLUE 


'PRf<SMI77Er 

BITS 


QU«»<7IZEP  POSITIVE  VRLL€S  FOP  £  LEVELS 


0UR»<7I2EP 

BOUr^B 


QLWmZER 

VftLUE 


TPRr<SRIT'’EI) 

BITS 


TABLE  1.  DPCM  QUANTIZER  VALUES  (Cont'd) 
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5.7  The  inverse  DPOf  at  the  ground  station  is  alnost  identical  to  the  DPCM 
only  less  hardware  is  required.  The  inverse  DPCM  is  shown  below. 


5.8  Two  different  designs  for  the  DPCM  have  been  built  at  NUC.  The  first  is 
a  hybrid  analog,  digital  system  shown! schematically  below 


Coc  ( 


“  f' 

V 

'  r  ' 

f  I - -  ■'  j 

HoiJ  i  Craoi.'te’-  i 

A 

DELAY  I 

't'  ' - i 

0/A 


- .  o<' 
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I  This  design  has  the  advantage  that  the  analog  to  digital  converter  Is  nonlinear. 
It  has  8  bits  of  dynamic  range  but  only  4  bits  of  resolution.  At  low  bit  rates 
(i.e.  large  bandwidth  reduction)  4  bits  or  16  quantization  levels  are  all  that 
are  required.  Therefore,  a  very  fast  A/D  and  DPCM  can  be  implemented  using  16 
comparators.  This  DPCM  design  has  two  disadvantages.  1)  The  analog  differ¬ 
ential  amplifier  and  D/A  converter  at  the  input  can  introduce  noise  into  the 
system.  2)  For  high  data  rates  (low  compression)  it  is  desriable  to  have 
more  than  16  quantization  levels.  At  1.6  m  bits/sec,  6  bits  or  64  levels 
are  desirable  for  the  first  few  low  frequency  coefficients. 


5.9  A  second  all  digital  DPCM  has  been  designed  and  Is  currently  being  Imple¬ 
mented.  It  is  intended  to  be  used  with  a  'cosine  transform  that  reduces  the 
processing  speed  from  4.8  Mlz  to  1.2  Ktiz.  A  small  relatively  low  speed  8  bit 
A/D  converter  can  ther  be  used  directly  on  the  output  of  the  DCT.  All  DPCM 
arithmetic  operations  are  then  performed  digitally.  This  approach  uses  some¬ 
what  less  hardware  since  1)  it  is  running  slower  and  2)  only  one  A/D  is  required 
rather  than  both  an  A/D  and  D/A.  The  disadvantage  of  the  design  Is  that  the  coeffi¬ 
cients  must  enter  the  DPQI  at  a  low  rate  (1.2  Miz)  or  a  4.8  ICz  A/D  must  be  used. 
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5.10  The  block  diagram  of  che  all  digital  DPCM  is  shovm  below. 


In  both  designs  the  quantization  occurs  in  two  steps.  1)  The  input  data  Is 
assigned  to  one  of  2  max  quantization  levels.  2)  The  k  bits  are  then 
truncated  to  k  bits  as  dictated  by  the  ROM  that  stores  ?^e  number  of  bits  for 
each  coefficient.  These  k  bits  and  the  bits  for  the  nu. '  r  of  bits  per  coeffi¬ 
cient  address  a  ROM  that  outputs  the  appropriate  quantization  value.  The  data 
for  the  ROM's  for  a  6  bit  quantizer  are  given  in  Table  1.  The  input  data  rate 
for  the  DPCM  is  determined  by  the  DCT.  It  is  desirable  that  the  DCT  have  an 
output  rate  less  than  the  4.8  !lHz  to  reduce  the  speed  at  which  the  DPCM  must 
operate.  The  output  data  rate  for  the  DPCM  is  fixed  by  the  modem  Interface 
requirements.  The  DPCM  output  is  serial  at  four  fixed  rates:  200,  400,  800 
and  1600  MHz.  The  required  data  rates  translate  to,  respectively,  13,  26,  52, 
and  104  bits  per  TV  line.  The  appropriate  data  rate  is  obtained  by  choosing  the 
number  of  bits  used  to  quantize  each  coefficient  such  that  the  sum  equals  the 
required  number  of  bits  per  TV  line.  The  optimum  bit  assignments  for  minimizing 
mean  square  error  have  been  determined  by  Habibi  at  U.S.C.  and  are  given  in  the 
table  below.  These  bit  assignment  are  not  necessarily  best,  and  the  DPCM  design 
should  allow  bit  assignment  changes  to  be  made  easily  (l.e.  changing  a  ROM). 

The  DPCM  must  be  able  to  switch  between  the  four  bit  rates  on  command  from  the 
ground.  Two  input  bits,  x  and  y,  are  supplied  by  the  command  and  control  link 
that  specify  the  bit  rate,  as  shown  below. 


X _ Y.  Bit  Rate  (kllobits/sec) 

0  0  1600 

0  1  800 

1  0  400 

1  1  200 

5.11  The  output  data  from  the  DPCM  is  serial  and  the  bits  are  presented  to 
che  modem  syncronously  with  a  data  clock  at  the  appropriate  rate  (200  KHz, 

400  KHz,  etc.).  The  data  clock  is  supplied  by  the  modem.  The  data  is  generated 
by  the  DPCM  at  a  nonuniform  rate  and  a  small  buffer  is  required  between  the 
DPCM  and  the  modem. 
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5.12  The  interface  from  the  OPCM  to  the  modem  is  currently  implemented  and  is 
described  below.  (Sea  schematice  at  the  end  of  this  report.) 


5.13  As  mentioned  earlier,  when  fewer  than  6  bits  are  used  to  quantize  a 
coefficient,  only  the  high  order  bits  of  the  6  bit  word  are  retained.  The 
interface  is  a  double  buffered  arrangement  where  all  thirty- two,  6  bit  quan¬ 
tization  levels  are  stored.  Only  selected  bits  are  read  from  the  aemory  for 
transmission  to  the  modem.  A  read  only  memory  is  addressed  sequentially  by 
the  data  clock  from  the  modem.  A  ROM  then  reads  the  appropriate  bits  from 
one  buffer,  while  new  data  is  stored  in  the  other  buffer.  At  the  end  of  each 
TV  line  the  rolls  of  the  buffers  are  interchanged.  The  bits  that  should  be 
truncated  because  fewer  than  6  bits  are  used  for  a  coefficient  are  not  addressed 
by  the  ROM  ah<r tHerefore^jidtJtTiwismftfed.  The  order  in  which"  thr  ROM  addresses 
the  bits  is  such  that  all  the  bits  that  must  be  sent  at  the  200  KHz  ^tajca,te 
are  transmitted  first.  Next,  all  bits  required  for  the  400  KHz  data  rate  (but 
not  for  the  200  KHz  rate)  are  transmitted.  This_pattem  continues  until  all 
bits  have  been  accounted  for.  Since  the  ROM  address  is  incremented  by  the 

data  clock  from  the  modem  and  reset  to  zero  at  the  beginning  of  each  TV  line, 
the  number  of  bits  sent  is  determined  only  by  the  data  clock  frequency,  and  the 
correct  bits  are  always  transmitted. 

5.14  The  interface  at  the  ground  station  is  the  reverse  of  the  above  process. 

A  ROM  stores  the  buffer  address  in  which  each  input  bit  is  to  be  stored.  The 
quantized  data  is  then  reconstructed  in  one  buffer  while  the  other  buffer  feeds 
^e  inverse  DPCM.  Again,  the  role  of  the  two  buffers  is  changed  at  the  beglo- 
nlng  of  each  TV  line. 

5.15  This  represents  only  one  of  many  possible  interfaces  that  could  be  designed 
and  is  probably  not  a  minimum  hardware  implementation.  An  alternative  approach 
is  to  use  a  first-in-first-out  memory  (FIFO)  to  buffer  the  data  between  the 

OPCM  and  the  modem.  This  requires  an  extremely  fast  FIFO  but  could  significantly 
reduce  parts  count. 


5,16  An  all  digital  DPCM  is  the  more  desirable  implementation  because  it 
allows  for  6  bit  quantitation  with  minimum  hardware.  However,  It  does  need 
either  a  fast  A/D  or  pcrfcrably  a  slow  input  rate  from  the  DCT .  The  drawings 
for  a  possible  digital  implementation  that  has  not  been  fully  tested  are  given 
at  the  end  of  this  report.  Schematics  for  the  hydrid  analog/digital  DPCM  are 
available  from  NUC  if  desired. 


Section  VI.  Frame  Store  Memory 


6.1  A  frame  store  memory  (FSM)  is  required  in  the  ground  station  to  reconstruct 
the  picture  from  the  8  stripes  that  are  transmitted  per  frame.  The  frame  store 
memory  as  currently  implemented  uses  a  commercial  32k  x  18  bit  solid  state 
memory  (specs  in  appendix)  and  stores  the  pixels  sequentally  3  pixels  per  18 
bit  word.  All  reconstructed  picture  elements  are  buffered  in  the  FSM  and  read 
out  at  EIA  standard  TV  rates.  Because  of  the  8  to  1  reduction  in  frame  rate, 
each  picture  element  is  displayed  8  times  before  being  updated. 

6.2  The  basic  operation  of  the  frame  store  memory  is  as  follows.  The  TV  sync 

in  the  plane  and  on  the  ground  are  locked  together  as  discussed  in  the  timing  and 
synchronization  section.  During  the  active  part  of  each  TV  line,  32  picture 
elements  (pixels)  are  reconstructed  at  the  ground  and  buffered.  Simultaneously, 
256  pixels  are  read  from  the  main  memory  and  displayed.  During  the  horizontal 
retrace  time,  the  32  new  pixels  are  stored  in  the  main  memory.  Since  the  camera 
and  the  display  are  sync-locked,  the  new  pixels  are  stored  in  the  same  line  as 
was  just  displayed.  The  stripe  (1/8  line)  in  which  the  pixels  are  stored  is 
determined  by  the  stripe  counter  on  the  ground,  which  is  also  synchronous  with 
the  one  in  the  plane. 

6.3  There  is  no  distinction  made  anywhere  in  the  bandwidth  compression  system 
between  the  even  and  odd  fields  of  the  TV  frame.  The  even  and  odd  fields  are 
assumed  to  be  the  same,  and  the  frame  store  memory  stores  only  one  field,  i.e. 

256  pixels  x  262.5  lines.  It  is  possible  by  doubling  the  FSM  size  to  store 
both  fields  and  thus  increase  the  number  of  vertical  lines  to  525.  There  is 
currently,  however,  no  desire  to  do  this. 

6.4  There  are  complications  to  the  operation  of  the  frame  store  memory  due  to 
the  fact  that  3  pixels  are  stored  in  each  main  memory  word.  This  means  that  32 
pixels  require  10  2/3  words.  Since  the  line  is  divided  into  eight  32-pixel- 
stripes,  some  stripes  do  not  start  or  end  on  even  word  boundaries  in  the  main 
memory.  A  problem  arises  when  only  part  of  a  main  memory  word  is  to  be  updated 
(i.e.  6  or  9  bits).  Because  that  part  of  the  word  not  updated  is  destroyed, 
one  must  compensate  by  reading  from  the  main  memory  the  pixels  that  could  be 
destroyed  in  the  update  process  (i.e.  the  first  and  last  words  of  the  block). 

The  new  pixels  are  merged  with  the  data  from  the  memory  and  rewTitten,  thus 
saving  the  pixels  that  are  not  updated. 

6.5  The  current  implementation  of  the  FSM'has  only  256  lines  per  frame  and  is 
not  set  up  to  accept  262.5  lines  per  field  video.  The  modification  to  the 
existing  design  to  incorporate  the  262.5  line  video  should  be  very  minor  and 
NUC  can  provide  guidance  in  this  area. 


30 


ISCSIS 


mall 

SSSI^HS 

nl 

pil 

ilhl 

ii\ 

DPcn  BLOCK  DJACrRAM 


/*  rvx‘ 


u 

0 

u 

u 

u 

u 

h  o 
( )  >1^ 

0 

0 

Sr! 

0 

0 

D 

■■■■■ 


Ifiiil 


iiiiiiAilEeil 


digital  OC.T 
ARCCl>5t*t  algorithm 


lirPBPi 


APPENDIX  B 


COMBINED  SPATIAL  AND  TEMPORAL  CODING 
OF  DIGITAL  IMAGE  SEOUENCES 


COMBINED  SPATIAL  AND  TEMPORAL  CODING 
OF  DIGITAL  IMAGE  SEQUENCES* 


John  A.  Roese 
Naval  Undersea  Center 
San  Diego,  California  92132 

Guner  S.  Robinson 

Image  Processing  Institute,  University  of  Southern  California 
Los  Angeles,  California  90007 


Abstract 

i 

Interframe  coding  of  television  images  encompasses  techniques  which  make  use  of  corre¬ 
lations  between  pixel  amplitudes  in  successive  frames.  Intraframe  coding  techniques  that 
exploit  spatial  correlations  can,  in  principle,  be  extended  to  include  correlations  in  the 
temporal  domain. 

In  this  paper,  successive  frames  of  digital  images  are  coded  using  two-dimensional  spa¬ 
tial  transforms  combined  with  DPCM  in  the  temporal  domain.  Specific  transform  techniques 
investigated  are  the  two-dimensional  cosine  and  Fourier  transforms.  Due  to  DPCM  encoding 
in  the  temporal  domain,  the  hybrid  transform/DPCM  encoders  require  storage  of  only  the 
single  previous  frame  of  data. 

Hardware  implementation  of  the  Fourier  transform  involves  manipulation  of  complex  num¬ 
bers  where  the  cosine  transform  does  not.  However,  the  Fourier  transform  is  attractive 
because  frame-to-frame  motion  compensation  can  be  introduced  directly  in  the  phase  plane 
by  application  of  appropriate  phase  correction  factors. 

Results  are  presented  in  terms  of  coding  efficiency,  storage  requirements,  computational 
complexity,  and  sensitivity  to  channel  noise. 

Introduction 

In  the  design  of  image  coding  systems  for  digital  communications  channels,  the  primary 
objective  is  to  minimize  the  number  of  code  bits  required  to  reconstruct  the  image  at  the 
receiver.  Reduction  in  the  number  of  code  bits  transmitted  results  in  reduced  channel 
bandwidth,  more  rapid  transmission  of  digital  images,  and  lower  transmitter  power  require¬ 
ments. 

Efficient  coding  of  the  digital  images  is  accomplished  by  removal  of  statistical  redun¬ 
dancies  that  exist  within  the  image.  Transform,  predictive,  and  hybrid  transform/predic- 
tive  image  coding  techniques  have  been  developed  to  exploit  intraframe  spatial  image  redun¬ 
dancies.  This  paper  describes  efforts  to  extend  these  image  coding  techniques  to  coding  of 
time-sequences  of  digital  images  transmitted  over  a  digital  communications  channel.  The 
emphasis  has  been  directed  towards  definition  of  an  image  coding  system  that  exploits 
temporal  as  well  as  spatial  image  redundancies. 


Intraframe  Image  Coding 


The  primary  techniques  that  have  been  developed  for  intraframe  image  coding  in  the  spa¬ 
tial  domain  are  transform,  linear  predictive,  and  hybrid  transform/linear  predictive  tech¬ 
niques.  Operational  descriptions  of  these  coding  methods  are  given  below. 


Transform  Image  Coding 


The  basic  premise  of  transform  image  coding  is  that  the  transform  domain  representation 
of  an  image  has  an  energy  distribution  that  is  more  compact  and  therefore  easier  to  effi¬ 
ciently  code  than  the  spatial  domain  version.  In  transform  coding  systems,  a  one-  or  two- 
dimensional  linear  transform  of  an  image  line  segment  or  block  is  performed  at  the  coder. 
The  transform  coefficient  statistics  are  computed  prior  to  quantization  and  coding  for 
transmission.  After  decoding  at  the  receiver,  an  inverse  transform  ii.  taken  to  obtain  a 
reconstructed  image.  Transforms  that  have  proven  useful  for  this  application  include 
Fourier,  cosine,  Hadamard,  Slant,  and  Karhunen-Loeve. It  should  be  noted  that  the 
two  transforms  can  be  different  for  the  two  spatial  directions. 


Two-dimensional  transforms  have  the  inherent  disadvantage  of  requiring  an  intermediate 
memory  to  store  the  transform  coefficients  computed  in  one  direction  while  the  transform 
is  being  computed  in  the  other  direction.  Transform  coding  techniques  have  been  explored 
extensively  both  theoretically  and  by  simulation.  It  has  been  shown  that  significant  bit 
rate  reductions  can  be  achieved  in  many  applications  with  minimal  image  degradation. 


*To  fktSHK  Confmner  Proettdiiiti  (Aug.  1975). 
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Simulation  results  indicate  that  a  bit  rate  reduction  to  1.5 

makino  the  transform  coding  system  adaptive. 

Predictive  Image  Coding 

The  high  degree  of  correlation  between  a  given  pixel  value  and  it?  nearest  neighbors 
allows  an^image  to  be  efficiently  represented  by  coding  only  the  difference  between  each 
pixel  value  and  its  predicted  value.  The  predicted  pixel  value  is  based  on  previously 
scanned  pixel  values?  In  a  differential  pulse  code  modulation  (DPCM)  system,  the  pre¬ 
dicted  value  of  each  pixel  is  subtracted  from  its  actual  value  and  this  difference  is 
quantized  and  transmitted.  Quantization  of  the  data  is 

the  probability  density  of  the  difference  signal.  Thus,  coding  by  ®  „ 

requires  a  knowledge  of  the  data  statistics.  The  basic  operation  of  the  DPCM  coder  is  to 

generate  an  uncorrelated  signal  which  is  then  encoded  by  a  IrLlieted 

mission  At  the  receiver,  the  quantized  difference  signal  is  combined  with  its  pr^ict^ 
value  to  form  the  reconstructed  pixel  value.  Basic  DPCM  image  coding  ^  ^ 

quality  at  about  3  bits/pixel.  Adaptive  systems,  in  which  parameters  of  the  quantizer  and 
predictor  adapt  to  the  image  content,  require  about  2  bits/pixel. ni 

Hybrid  Image  Coding 

Aiialvsis  of  transform  and  predictive  coding  techniques  has  shown  that  ^th  techniques 
possess  a^ractivrcharacteristics  and  certain  limitations.  Transfo^  coding  technique. 
Lhieve  good  performance  at  low  bit  rates,  show  less  sensitivity  to  data 

ture-to-picture  variations)  and  are  less  vulnerable  to  channel  "Oise. . Predictive  c^ing 

svstems  are  superior  to  transform  techniques  in  terms  of  equipment  complexity,  memory 
reSm^Sts?  ^d  performance  at  high  bit  rates.  Some  limitations  of  predictive  systems 
are  their  sensitivity  to  data  statistics  and  to  channel  error. 

An  intraframe  hybrid  coding  system  that  combines  the  attractive  features  of 
form  aiS  pr^tctive  coding  sylteL  has  been  devised.  161  In  this  system,  a  one-dimensional 
transform^^is  followed  by  DPCM  linear  predictive  coding  of  the  ^ansform  domain  coeffi 
cients.  At  the  receiver,  the  transform  coefficients  are  decoded  and  a  replica  of  the 
original  image  is  reconstructed  by  an  inverse  transform. 

Hybrid  Interframe  Image  Coding 

Interframe  coding  of  digital  image  sequences  encompasses  those  techniques  which  make 
use  of  the  high  correlation  that  exists  between  pixel  amplitudes  in  !’^®2e 

intraframe  coding  techniques  that  exploit  spatial  correlations  can,  in  ^ 

extended  to  include  correlations  in  the  temporal  domain.  Previous  research  in  the  area 
three-dimensional  Fourier  and  Hadamard  transformations  has  indicated  that  bit  rates  can  be 
reduced  by  a  factor  of  five  by  incorporating  correlations  in  the  temporal  direction. 
H^eS^,  three-dimensional  trLsform  systems  are  unattractive  as  they  use  large  amounts  of 
data  storage  and  require  excessive  computations. 

To  alleviate  the  problems  associated  with  three-dimensional  transform  systems,  new 
hybrid  (two-dimensional  transform) /DPCM  image  coding  systems  have  been  develo^d.l  i 
These  systems  utilize  both  spatial  and  temporal  correlations  while  greatly  r^ucing  mem 
«r«to«grand  ciiputational  requirements.  The  block  diagram  for  a  hybrid  (two-di^n- 
sional  transform) /DPCM  system  is  shown  as  figure  1.  In  present  ,, 

system,  either  a  two-dimensional  cosine  or  Fourier  transformation  is  performed  on  16  x  16 
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Fig.  1.  Hybrid  (two-dimensional  transform) /DPCM  coder. 
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subblocks.  DPCM  linear  predictive  coding  in  the  temporal  doisain  is  then  applied  to  the 
transform  coefficients  of  each  subblock.  For  notational  convenience,  the  hybrid  inter¬ 
frame  coders  employing  two-dimensional  Fourier  transforms  will  be  denoted  as  FFD  and  those 
using  two-dimensional  cosine  transforms  as  CCD.  The  FFD  and  CCD  coders  are  adaptive  in 
the  sense  that  statistics  of  the  transform  coefficient  differences  of  each  subblock  are 
computed  prior  to  encoding  the  transform  coefficients  in  the  temporal  direction  by  paral¬ 
lel  DPCM  coders.  At  the  receiver,  the  transmitted  transform  coefficients  are  decoded  and 
a  replica  of  each  frame  is  reconstructed  by  the  appropriate  inverse  two-dimensional  trans¬ 
formation.  These  systems  require  only  a  single  frame  of  storage  and  involve  significantly 
less  memory  and  fewer  computations  than  three-dimensional  transform  coding  techniques. 

Operational  Modes 

At  least  three  operational  modes  have  been  Identified  for  the  hybrid  interframe  coding 
systems.  These  operational  modes  depend  on  the  initial  conditions  assumed  for  the  pre¬ 
dictive  coder.  The  initial  conditions  are: 

a.  No  apriori  information  available  at  the  receiver, 

b.  Limited  information  (such  as  mean,  variance  and  temporal  correlations  based  on  a 
statistical  model)  available  at  the  receiver,  and 

c.  First  frame  available  at  the  receiver. 

In  the  no  apriori  information  available  case,  several  frames  are  required  for  the 
hybrid  coder  to  settle.  However,  it  has  been  experimentally  verified  that  xn  the  remain¬ 
ing  two  cases,  nearly  stable  coder  performance  is  achieved  within  the  first  4  to  6  frames. 
From  operational  considerations,  the  third  set  of  initial  conditions  is  the  most  realistic 
as  periodic  full  frame  updating  will  be  required  to  eliminate  the  cumulative  effects  due 
to  channel  noise. 

Mathematical  Formulation 

Let  f(x,y)  denote  a  two-dimensional  array  of  intensity  values  on  an  NxN  subblock  of  a 
digital  television  image  of  size  MxM.  Typical  values  for  K  and  N  are  256  and  16,  respec¬ 
tively.  Let  F(u,v)  be  the  two-dimensional  array  obtained  by  taking  the  two-dimensional 
transform  of  f(x,y).  In  the  case  of  the  two-dimensional  discrete  Fourier  transform,  the 
expressions  relating  f(x,y)  and  F(u,v)  are 
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where  H  is  the  size  of  the  square  subblock,  f(x,y)  is  the  image  intensity  function  in  the 
spetial  domain,  x  and  y  are  spatial  coordinates,  F(u,v)  is  the  Fourier  transform,  and  u 
and  V  are  spatial  frequencies. 

For  Image  processing  applications,  f(x,y)  is  a  positive  real  function  representing 
brightness  of  the  spatial  sample.  The  two-dimensional  Fourier  transform  of  a  real-valued 
function  has  the  following  conjugate  symmetry  property: 

F«(u,v)  -  F(N-u,N-v)  ,  u,v  -  1,  2,  ....  j  -  1 

The  Fourier  transform  consists  of  2n2  components,  i.c.,  the  real  and  imaginary  or  magni¬ 
tude  and  phase  components  of  each  spatial  frequeiiCV.  Ho%:ever,  as  a  result  of  the  con¬ 
jugate  symmetry  properties  awntioned  above,  only  h'  components  are  required  to  completely 
define  the  Fourier  transform. (*l 

In  the  case  of  the  Fourier  transform,  a  shift  In  the  spatial-domain  variables  results 
In  a  multiplication  of  the  Fourier  transform  of  the  un-shifted  image  by  a  phase  factor. 

If  the  input.  Image  f{x,y,ti)  Is  shifted  by  the  amount  xq  in  the  x-direction  and  yg  in  the 
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y-direction  between  times  t^  and  t2,  then  the  Fourier  transform  of  the  shifted  image  is 
given  by 


F(u,v,t2)  =  F(u,v,t^)  exp  ^  (uXjj  +  vy^jj 


This  shifting  property  is  expected  to  be  useful  in  detecting  and  compensating  for 
effects  of  motion  between  frames  since  many  types  of  motions  such  as  panned  motion  pr^uce 
significant  changes  in  phase  components  and  small  changes  in  amplitude  components.  Thus, 
compensation  for  platform  motion  may  be  implemented  directly  in  the  array  of  phase  compo¬ 
nents  by  application  of  appropriate  phase  correction  factors. 

The  two-dimensional  Fourier  transform  F(u,v)  of  a  spatial  signal  function  f(x,y)  is 
separable,  i.e.,  it  can  be  computed  as  two  sequential  one-dimensional  transforms  since  the 
Fourier  kernel, 

exp  |t  (ux  +  vy)|  , 

is  separable  and  symmetric.  Thus,  the  basic  one-dimensional  discrete  Fourier  transform 
that  must  be  performed  is 


1  N  — 1  i  3 11  i  \ 
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for  u=0,  1,  .  .  . ,  N-1. 

In  the  case  of  the  discrete  Cosine  transforral^l ,  the  one-dimensional  transform  is 


N-1 

P  (u)  =  g  S  f  (x)  cos 


(2x+l)uii 

2n 


for  u»0,  1,  .  .  N-1.  The  form  of  eq.  (5)  differs  from  that  of  reference  (31  only  by  a 

normalization  constant.  The  cosine  transform  is  also  separable  and  a  two-dimensional  dis- 
C£-ete  cosine  transform  of  an  NxN  subblock  results  in  N  real  coefficients. 

Quantization 

Experimental  evidence  derived  from  transmission  of  a  typical  "head  and  shoulders"  pic¬ 
ture  telephone  scene  has  shown  that  the  frame  difference  signal  has  a  probability  density 
closely  approximated  by  a  double  sided  exponential  function. 1 101  The  optimum  minimum 
mean  square  error  quantizer  for  this  distribution  has  been  found  to  be  a  uniform  quantizer 
combined  with  a  companding  of  the  frame  difference  signal, 

Since  the  variances  of  the  transform  domain  coefficient  dif ferenc-'^s  are  different,  it 
is  necessary  to  use  different  quantizer  parameters  for  each  one.  Eacn  coefficient  differ¬ 
ence  signal  is  allocated  a  number  of  bits  proportional  to  its  estimated  variance  in 
accordance  with  an  optimum  bit  assignment  algorithm. 

Fidelity  Criteria 

In  figure  1,  differences  between  input  signal  f<x,y,t)  and  output  signal  f(x,y,t) 
are  due  to  two  sources!  quantization  errors  and  errors  due  to  channel  noise.  To  evalu¬ 
ate  coding  efficiency  of  the  hybrid  encoders,  two  objective  criteria  were  us^.  The  first 
criterion,  NMSE,  is  a  measure  of  the  mean  square  error  between  f(x,y,t)  and  f(x,y,t) 
averaged  over  an  entire  frame  of  size  MxM.  Normalization  is  achieved  by  dividing  the  mean 
square  error  by  the  mean  signal  energy  within  the  frame. 

M-1  M-1  ,  .  12 
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The  second  criterion,  SNR,  measures  the  ratio  of  peak-to-peak  signal  to  RMS  noise: 
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Figures  2  and  3  are  graphs  illustrating  the  coding  efficiency  of  the  hybrid  FFD  and  CCD 
coders  at  various  bit  rates  in  the  interval  0.1  to  1.0  bits/pixel/frame.  To  perform  this 
series  of  experiments,  a  256  x  256  resolution  data  base  consisting  of  16  consecutive 
frames  of  a  24  frames  per  second  (fps)  motion  picture  was  digitized.  Initial  conditions 
assumed  were  that  the  first  frame  was  available  at  the  receiver. 
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Fig.  2.  Fourier/Fourier/DPCM  coder 
at  various  bit  rates. 


Fig.  3. 


Cosine/cosine/DPCM  coder 
at  various  bit  rates. 


Photographs  of  frame  number  16  after  coding  by  the  FFD  and  CCD  coders  at  average  pixel 
bit  rates  of  1.0,  0.5,  0.25,  and  0.1  are  shown  as  figures  4  and  5.  The  results  shown  in 
figure  4  for  the  FFD  coder  were  obtained  by  coding  the  real  and  imaginary  components  of 
the  Fourier  coefficients  by  assigning  half  of  the  available  bits  to  each  component. 


a-na-HH-iH-i 

^ 

1.0  bits/pixel/framc  0.5  bits/pixel/frame  0.25  bits/pixel/frame  0.1  bits/pixel/f rame 

Pig.  4.  FFD  coder  for  frame  16. 


1.0  bits/pixel/frame  0.5  bits/pixel/frame  0.25  bits/pixel/frame  0.1  bits/pixel/frame 

Fig.  5.  CCD  coder  for  frame  16. 
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Noise  Imntunit 


Performance  of  the  FFD  and  CCD  hybrid  interframe  coders  was  investigated  in  the  pres¬ 
ence  of  channel  noise.  In  order  to  study  the  effect  of  channel  noise,  a  binary  sysunetric 
channel  was  simulated.  The  channel  is  assumed  to  operate  on  each  binary  digit  independ¬ 
ently,  changing  each  digit  from  0  to  1  or  from  1  to  0  with  probability  Pee  and  leaving  the 
digit  unchanged  with  probability  1-Pce-  the  receiver,  the  encoded  picture  is  recon¬ 

structed  from  the  string  of  binary  digits,  including  errors,  transmitted  across  the  channel. 

Degradations  due  to  channel  noise  probabilities.  Pee'  zero,  10  ^  and  10  ^  for  the 
FFD  and  CCD  coders  at  average  bit  rates  of  1.0  and  0.25  bits/pixel/frame  are  shown  in 
figures  6  through  9.  The  generally  monotonical ly  increasing  character  rf  these  curves 
illustrates  the  fact  that  once  an  error  has  occurred,  it  tends  to  propagate  in  the  tem¬ 
poral  direction  until  corrected  by  a  frame  refresh. 


Fig.  6.  Effects  of  channel  noise  for  Fig.  7.  Effects  of  channel  noise  for 

Fourier/Fourier/DPCM  coder  at  Fourier/Fourier/DPCM  coder  at 

an  average  1.0  bits/pixel/frame.  an  average  0.25  bits/pixel/frame. 


Fig.  8.  Effects  of  channel  noise  for  Fig.  9.  Effects  of  channel  noise  for 

cosine/cosine/DPCM  coder  at  an  cosine/cosine/DPCM  coder  at  an 

average  1.0  bits/pixel/frame.  average  0.25  bits/pixel/frame. 


Photographs  corresponding  to  average  bit  rates  of  1.0  a^  0.25  bits/pixel/frame  for  the 
FFD  and  CCD  coders  with  channel  error  probabilities  of  10*3  and  10"‘  are  shown  in  figures 
10  and  11. 
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Fig.  11.  CCD  coder  wi 


0.25  bits/pixel/frame 
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th  channel  noise. 


Bit  Transfer  Rate 


In  keeping  with  the  previously  mentioned 
transmitted  while  retaining  image  fidelity, 
certain  bit  transfer  rates  (BTR)  across  the 
product  of  average  pixel  bit  rate  per  frame 


objective  of  minimizing  the  number  of  bits 
a  series  of  experiments  was  performed  in  which 
channel  were  fixed.  The  BTR  is  defined  as  the 
and  frame  rate  and  has  units  of  bits/pixel/sec 


BTR  =  (bits/pixel/frame)  »  (frames/sec)  (8) 

The  available  16  frame  test  data  base  was  extracted  from  a  24  fps  motion  picture 
sequence.  By  employing  frame  skipping  techniques,  temporal  subsampling  was  used  to  simu¬ 
late  short  12,  8  and  6  fps  sequences  from  the  16  frame  test  data  base. 


Average  bit  rates  in  the  interval  0.083  to  1.333  bits/pixel/f rarae  were  used  in  conjunc¬ 
tion  with  the  four  frame  rates  mentioned  above  to  perform  simulations  with  BTR  values  of 
8,  6,  4  and  2  bits/pixel/sec.  The  results  of  these  experiments  are  shown  as  figures  12 
through  15,  respectively.  For  all  cases  examined,  the  graphs  show  that  reduced  frame  rates 
produce  smaller  NMSE  values  for  the  individual  frames  coded.  This  indicates  that  reduc¬ 
tions  experienced  in  frame-to-f rarae  correlations  due  to  temporal  subsampling  are  completely 
compensated  for  by  th j  increased  number  of  bits  available  for  coding.  However,  subject¬ 
ively,  reduced  frame  rates  tend  to  result  in  jerky  subject  motion.  This  is  most  apparent 
for  rapidly  moving  objects  in  the  field  of  view  and  is  of  lesser  consequence  for  slowly 
changing  scenes. 
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Fig.  12.  Cosine/cosine/DPCM  coder  at 
bit  transfer  rate  of  8  bits/ 
pixel/sec. 


Fig.  13.  Cosine/cosine/DPCM  coder  at 
bit  transfer  rate  of  6  bits/ 
pixel/sec. 
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Fig.  14.  Cosine/cosine/DPCM  coder  at 
bit  transfer  rate  of  4  bits/ 
pixel/sec. 


Fig.  15.  Cosine/cosine/DPCM  coder  at 
bit  transfer  rate  of  2  bits/ 
pixel/sec. 


Conclusions 

Based  on  theoretical  and  experimental  results  obtained  to  date,  two  main  conclusions 
have  been  reached.  The  first  is  that  exploitation  of  temporal  correlations  in  addition  to 
spatial  correlations  has  been  demonstrated  to  be  a  viable  technique  for  coding  sequences 
of  digital  images.  This  fact  is  demonstrated  by  a  comparison  of  the  average  bit  rates 
required  for  the  interframe  cosine/cosine/DPCM  and  the  existing  intraframe  cosine/DPCM 
coders  to  achieve  the  same  level  of  NMSE  performance.  The  sixteenth  frame  of  the  test 
data  base  was  chosen  for  comparison  and  was  coded  at  an  average  0.25  bits/pixel  by  the 
interframe  cosine/cosine/DPCM  coder.  When  using  the  intrafrane  cosine/DPCM  coder,  it  was 
necessary  to  code  this  frame  at  a  bit  rate  of  more  than  2  bits/pixel  to  achieve  the  same 
NMSE. 

The  second  conclusion  is  that  the  performance  of  the  hybrid  interframe  coders  investi¬ 
gated  are  heavily  data  dependent.  In  the  case  of  the  16  frame  head  and  shoulders  test 
data  base,  good  coding  performance  was  achieved  since  subject  movement  was  restricted  to  a 
relatively  small  portion  of  the  image.  However,  coding  performance  with  a  different  aerial 
data  base  was  degraded  from  the  previous  case  due  principally  to  platform  motion  which 
caused  frame-to-f rame  pixel  amplitude  variations  across  the  entire  image.  Since  the  per¬ 
formance  of  the  hybrid  interframe  coders  is  dependent  on  temporal  correlation,  a  reduced 
level  of  performance  is  to  be  anticipated  for  image  sequences  distorted  by  motion. 
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ABSTRACT 

Transform,  differential  pulse  code  modulation 
iDPCM),  and  hybrid  transform/DPCM  image  coding 
methods  are  applied  to  coding  successive  frames  of 
digital  images.  These  coding  techniques  are  design¬ 
ed  to  exploit  the  inherent  spa^'al  and  temporal  cor¬ 
relations  of  image  sequences.  In  transform  image 
coding,  subsections  of  images  are  transformed  into 
arr.ays  of  nearly  uncorrelated  coefficients  by  the  use 
of  orthogonal  transformations.  Bit  compression  is 
achies'ed  by  assigning  the  available  bits  in  proportion 
to  the  energy  of  the  transform  coefficients.  In 
OPCM  image  coding  systems,  the  value  of  the  image 
sample  is  predicted  and  the  difference  between  the 
actual  and  the  predicted  value  is  quantized  and  trans¬ 
mitted.  Hybrid  transform/DPCM  coding  impleme.i- 
tations  combine  the  relatively  superior  low  bit  rate 
performance  and  channel  noise  immunity  of  trans¬ 
form  methods  with  the  minimal  storage  requirements 
of  DPCM  encoders.  Interframe  coding  implemen¬ 
tations  studied  include:  three-dimensional  Cosine 
transforms,  hybrid  two-dimensional  spatial  trans¬ 
forms  with  DPCM  in  the  temporal  direction,  e.  g., 
Cosinc-Cosine/DPCM  and  Fourier- Fourier /DPCM, 
and  three-dimensional  DPCM.  Results  are  sum¬ 
marized  by  evaluating  the  various  implementations 
in  terms  of  performance  and  complexity  criteria. 

INTRODUCTION 

During  the  past  few  years  several  intraframe 
digital  image  coding  systems  based  upon  transform 
and  linear  predictive  coding  concepts  have  been  de¬ 
veloped  [1,2).  Most  of  these  systems  have  achieved 
bit  rate  reductions  by  removal  of  statistical  redun¬ 
dancies  within  an  image  frame  combined  with  the  de¬ 
letion  of  that  part  of  the  spatial  image  representation 
least  critical  to  the  human  observer.  It  is  kno^ATi 
that  there  is  considerable  temporal  redundancy'  be¬ 
tween  frames  of  real  time  imagery  systems:  also 
there  are  psychophysical  limitations  to  temporal 
image  perception.  Exploitation  of  either  property 
in  the  past  has  been  difficult  to  achieve  because  of 
implementation  problems.  However,  there  have 
been  recent  technological  advances  in  signal  proces¬ 
sing  circuitry  which  hold  promise  for  the  practical 
implementation  of  digital  real  time  television  image 
coders.  Several  such  systems  are  described  in 
this  paper  along  with  an  analysis  of  their  perfor¬ 
mance, 

INTRAFRAME  TRANSFORM  AND  PREDICTIVE 
CODING  ^ 

Transform  coding  and  linear  predictive  coding 


*To  be  fmbHake^  im  ICC  Conferemce  hncee^umf  (Jen.  1975). 


are,  perhaps,  the  most  widely  employed  techniques 
for  intraframe  image  coding.  Operational  descript¬ 
ions  of  these  coding  methods  are  given  below. 

Transform  Coding:  In  transform  coding  systems, 
a  one  or  two-dimensional  mathenaatical  transform  of 
an  image  line  segment  or  block  is  performed.  The 
resulting  transform  coefficients  are  then  quantized 
and  coded.  A  bit  rate  reduction  is  possible  because 
the  distribution  of  energy  in  the  transform  domain 
permits  more  efficient  quantization  and  coding. 

After  decoding,  an  inverse  transform  is  taken  to 
obtain  a  replica  of  the  image  at  the  receiver.  Trans¬ 
forms  that  have  proven  useful  include  the  Fourier, 
Hadamard,  Slant,  Karhunen-Loeve,  and  Cosine  trans¬ 
forms  [3-5"’,  Transform  coding  techniques  have 
been  explored  extensively  both  theoretically  and  by 
simulation.  It  has  been  shown  that  a  significant  bit 
rate  reduction  can  be  achieved  in  many  applications 
with  minimal  image  degradation.  Simulation  results 
indicate  that  a  bit  rate  reduction  to  1,5  bits 'pixel  can 
be  obtained  for  monochrome  image  transform  coding 
in  16  x  16  pixel  blocks,  \%4iilc  color  images  require 
about  2.0  bits  pixel  ‘6  ,  The  bit  rate  can  be  re¬ 
duced  further  by  making  the  transform  coding 
system  adaptive. 

Prcdicti\e  Coding:  The  high  degree  of  corre¬ 
lation  between  a  given  pixel  value  and  its  nearest 
neighbors  allows  an  image  to  be  efficiently  repre¬ 
sented  by  coding  only  the  difference  between  each 
pixel  value  and  its  predicted  value.  The  predicted 
pixel  value  is  based  on  previously  scanned  pixel 
values.  In  a  differential  pulse  code  modulation 
fDPCMl  system,  the  predicted  value  of  each  pixel  is 
subtracted  from  its  actual  value  and  this  difference 
is  quantized  and  transmitted.  At  the  receiver,  the 
quantized  difference  signal  is  combined  with  its  pre¬ 
dicted  value  to  form  the  reconstructed  pixel  value. 
Basic  DPCM  image  coding  systems  proWde  good 
quality  at  about  3  bits 'pixel.  Adaptive  systems,  in 
\»diich  parameters  of  the  quantizer  and  predictor 
adapt  to  the  image  content,  require  about  2  bits/pixel. 

INTERFRAME  TRANSFORM  CODING 

In  interframe  transform  coding,  a  three-dimen¬ 
sional  unitary  transform  is  performed  on  the  data. 

Let  f(x,  y,  t)  denote  a  three-dimensional  array  of 
amplitude  values  for  each  frame  of  a  digital  image. 
Also,  let  F(u,v,  w)  be  the  three-dimensional  array 
obtained  by  taking  the  three-dimensional  transform 
in  the  (x,  y,  t)  domain.  If  the  size  of  the  three- 
dimensional  array  is  NJXN2XN3  ,  then  such  a  trans¬ 
form  coder  can  be  described  in  a  general  form  as 
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N^-l  N 


F(u,v,  w)  =  ELE  f(x,y,t)0(u,  v,w,  x,y,t) 
t=0  y=0  x=0 


(1) 


Nj-l  N3-I  N^l 


fix.  y .  t)  '''  *’  *’ 

w=0  v=0  u=  0 


where  x  and  y  are  spatial  coordinates,  t  is  the  time 
coordinate,  u,  v,  w  are  the  transform  domain  coordi¬ 
nates,  and  0(u,  V,  w,  X,  y,  t)  represents  a  set  of  three- 
dimensional  basis  vectors.  For  the  three-dimen¬ 
sional  discrete  Fourier  transform,  eq.  (1)  has  the 


form. 


Nj-l  N^-l  Nj-1 


123  J_Q  ysO  X=0 


exp 


Nj-l  N^-l  Nj-1 


f(x.y,t)=5|^  ^  ^Flu.v.w), 
w=0  v=0  u=0 


exp 

Since  the  Fourier  kernel 


exp 


is  separable,  the  three-dimensional  transform  can 
be  computed  as  a  one-dimensional  transform  in  the 
temporal  direction  followed  by  a  two-dimensional 
transform  in  the  spatial  domain.  The  two-dimen¬ 
sional  spatial  transform  caiy  in  turn,  be  computed 
as  a  one-dimensional  transform  along  the  rows 
followed  by  a  one-dimensional  transform  along  the 
columns.  Thus,  the  basic  one-dimensional  trans¬ 
form  that  must  be  performed  is 


F(u)  = 


A^fix)  exp(-^ux) 
x=  0 


The  form  of  eq.  (4)  differs  from  that  of  reference  [7] 
only  by  a  normalization  constant.  An  example  of 
three-dimensional  transform  coding  of  a  16  frame  data 
base  is  given  in  figure  1.  This  figure  illustrates  *e 
decoded  images  for  frames  number  1  and  16.  In  tins 
example,  a  three-dimensional  Cosine  transform  was 
performed  on  cubic  bloc  ts  of  size  I6xl6xl6  on  16 
frames  of  size  Z56x256.  The  average  bit  rate  used 
was  1.0  bits^ixel. 


(2) 


for  u=0, 1 . N-1.  In  the  case  of  the  discrete 

Cosine  transform  [7l,  the  one-dimensional  tranS' 
form  li 


for  u=  0,  l»  •  •  • » N-l# 


Frame  1 


F  rame  16 


Figure  1 


INTERFRAME  DPCM  CODING 

In  DPCM  systems,  linear  prediction  is  used  to 
generate  a  differential  signal  which  is  quantized  and 
transmitted.  At  the  receiver  a  similar  predictor 
uses  some  previously  transmitted  values  of  the  quan- 
tired  differential  signal  to  obtain  a  facsimile  of  the 
transmitted  signal. 

Prediction  of  a  picture  element  is  performed  by 
using  a  set  of  previously  scanned  picture  elements 


(5) 


where  fs;}  is  the  set  of  picture  elements  with  zero 
mean  and  variance  a^.and  n  is  the  order  of  the  predic¬ 
tor.  The  predictor  parameters  aj  are  the  solutions 
of  n  algebraic  equations: 


R  =■>  a.R. .  i*l,  2, 
oi  S  ij 


(6) 


(3) 


where  Rij  are  the  correlaHons  between  picture  ele- 
ments: 


(4) 


Rij=  Efs.Sjl 


The  variance  of  the  differential  signal 


(7) 
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i  1 

In  a  threo-diniensiunal  DPCM  interframe  coder, 
the  value  of  the  picture  element  is  estimated  using 
previously  scanned  picture  elements  as  shown  in 
figure  2  in  which  s^^  is  the  picture  element  to  b<* 
predicted.  The  elements  Sj,  S2,  S4,  and  S5  are  the 
previously  scanned  picture  elements  in  the  present 
frame,  and  S3,  ,  Sy , .  .  .  ,  s j2,  sj  3  are  the  closest 

samples  on  the  previous  frame. 


Figure  2 

Figure  3  shows  frames  number  2  and  I6  of  the 
16  frame  data  base  after  coding  by  a  three-dimen¬ 
sional  third  order  predictive  coder  at  2.  0  bits/pixel 
The  picture  elements  used  for  predicting  are  Sj 
and  S2  un  the  present  frame  and  S3  on  the  previous 
frame. 


Frame  2  Frame  I6 

Figure  3 


Three-dimensional  hybrid  encoders  have  been 
investigated  which  use  two-dimensional  transforma¬ 
tions  in  the  spatial  domain  cascaded  with  a  DPCM 
encoder  in  the  temporal  domain.  Due  to  DPCM  en¬ 
coding  in  the  temporal  direction,  these  encoders  re¬ 
quire  storage  of  only  the  single  preNdous  frame  of 
data.  Simulation  results  indicate  that  the  hybrid 
three-dimensional  encoders  perform  better  than  the 
corresponding  three-di niensional  transform  encoders. 
This  obse rv'^ation  is  confirmed  by  the  comparativ'^e 
performance  of  the  two-dim(*nsional  hybrid  and 
transform  encoders  previously  reported  in  the  lit¬ 
erature  8  ,  The  following  discusses  two  different 
three-dimensional  hybrid  coding  algorithms  which 
hav'c  been  investigated. 

Two- DiiTU'nsional  Cosine  Transforni/ DPCM 
Glider:  This  hybrid  coder  exploits  spatial  image 
correlations  by  taking  a  two-dimensional  discrete 
Cosine  transform  and  temporal  correlation  by  use  of 
a  DI^CM  coder.  Theoretical  studies  indicate  that 
this  hy'hrid  interframe  coder  possesses  the  attrac¬ 
tive  features  of  the  hybrid  intraframe  coder.  It  is 
anticipated  that  this  system  will  reduce  the  number 
of  bits  needed  for  reconstruction  of  television  images 
at  the  rocciv'er  by  a  factor  of  five  over  the  two- 
dimensional  hybrid  coder.  Figure  4  shows  the  results 
for  frames  1  and  16  after  applying  the  hybrid  Cosine- 
Cosinc/DPCM  encoder.  The  average  bit  rate  used 
vv'as  1,0  bits/pixcT, 


Frame  1  Frame  I6 

Figure  4 


Two-Dimensional  Fourier  Transform/DPCM 
Coder:  An  alternative  approach  to  the  three-dimen¬ 
sional  hybrid  coder  uses  the  two-dimensional 
Fourier  transform  in  place  of  the  two-dimensional 
Cosine  transform.  In  this  system,  the  two-dimen¬ 
sional  Fourier  transform  coefficients  of  each  frame 
arc  coded  and  transmitted.  Figure  5  illustrates  the 
results  for  frames  1  and  I6  due  to  Fourier- Fourier  / 
DPCM  coding  of  the  I6  frame  data  base.  These 
results  were  obtained  by  coding  the  real  and  imagi¬ 
nary  components  of  the  Fourier  coefficients  at  an 
average  bit  rate  of  0,  5  bits  for  each  component. 
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Frame  1  Krante  16 

F  i  li  ii  r  i- 

The  two-ciiniensional  Fourier  transform  of  the 
kth  frame,  Fj^fu,  s-),  tan  be  expressed  as 


1",  ''•If  V. 

F,  lu,  \-l  A,  lu,  vU''  K 
k  k 


where  Aj^iu.v)  :'nd  v)  are  the  amplitude  and  phasi* 

planes  of  the  kth  frame.  Many  l>’pes  of  motions  such 
as  panned  motion  produces  significant  changes  in  the 
phase  plane  and  small  changes  in  the  amplitude  plane. 
An  attractive  feature  of  amplitude-phase  coding  is 
that  motion  compensation  ituay  be  implemented 
directly  in  the  phase  plane  by  application  of  appro¬ 
priate  phase  correction  facto rs. 

SY STENiS  ANALYSIS 


In  this  section,  direct  comparisons  arc  made 
between  the  four  prcWously  discussed  interframe 
encoders.  The  comparison  criteria  used  are:  coding 
efficiency,  storage  requirements,  noise  immunity 
and  implementation  complexity. 

The  results  shown  as  figures  1,3,  4,  and  5  il¬ 
lustrate  the  coding  performance  of  the  four  encoders. 
For  MSE  comparison  purposes,  the  encoders  were 
run  at  an  average  bit  rate  of  1,0  bits /pixel.  This 
bit  rate  was  chosen  as  it  is  the  lowest  bit  rate  at 
which  the  three-dimensional  DPCM  encoder  can 
operate.  Additional  experiments  have  shown  that  the 
hybrid  transform/DPCM  encoders  can  be  successfully 
operated  at  even  lower  bit  rates.  For  each  encoder, 
MSE  calculations  were  made  for  each  frame  coded. 
The  MSE  v’alues  were  normalized  relative  to  total 
image  energy  of  each  frame.  Comparison  of  the 
normalized  MSE  values  indicate  that  the  hybrid 
Cosine-Cosine/DPCM  and  I  ourier-Fourier/DPCM 
encoder  implementations  were  superior  in  terms  of 
MSE  coding  efficiency.  This  conclusion  is  supported 
by  subjective  comparison  of  the  coded  frames  illus- 
t  rated  above, 

A  significant  disadvantage  of  all  interframe  cod¬ 
ing  systems  is  the  requirement  for  storage  of  pre¬ 
viously  scanned  data  frames.  Of  the  interframe 
systems  considered  in  this  paper,  the  three-dimen¬ 
sional  Cosine  implementation  is  the  least  attractive 
in  terms  of  required  storage  as  several  previous 
frames  must  be  retained.  Even  if  the  number  of 


frami'S  .sti>r«-d  :s  c  on  st  r:i  i  m-ci  lo  In’/.  nidn  r, 

•  •.s  iiiur,  thr  Itjl.il  nu'nu)r\'  rt  cu.i  r«  n ,»  nt  h  .;r«  ^lil] 

This  is  c-vidviit  whi-;'.  tht-  ih rt  t- -  c! i - 
sional  Cusiiu-  fiKudt  r  is  ci*nj]Hir»-d  t.,  ih«-  thrt  « 
impleim-ntatinns  which  ust-  firsl-urdt  r  pr»dicii'.« 
DPCM  c  oding  in  tlic  !<•  i ;  ipo ral  dwn'.ain,  Fndt  r  tlic 
assumption  <jf  first-order  tenipora]  predicli^rs,  i/nly 
thf  single  prt-'vious  frame  ni-eds  to  bi-  stored,  A 
spt-cial  case-  occurs  for  the  ih reo - di nu-n  si  ona] 
iinplc-nientaticm  where,  in  addition  to  the-  previous 
frame,  it  is  also  lU'CC'Ssary  to  retain  the*  j^resdo'-isly 
scanned  line  <ff  the  current  frainv. 

Immunity  to  channel  noise  varic-s  wndc  ly  for  the 
interframe  codcT  implementations  considered.  The 
IcTist  sensitive  is  the  three-dimensional  transform 
encoder,  vv'hereas  the  DPCM  encoder  is  niost  vuner- 
able  to  channed  noise-  due  to  its  transmission  of 
simple  pixel  aniplitude  differences  at  a  low  bit  rate. 
The  hybrifl  transform/predictive  encoders  transmit 
differences  in  transform  coefficients  instead  of  pixel 
amplitudes  and  are  less  sensitive  to  channel  noise 
than  strictly  predictive  encoders.  An  aspect  of 
predictive  interframc  coders  which  has  yet  to  be 
fully  inv'cstigated  is  their  nnienability  to  use  with 
c-rror  d«‘tection  and  correction  algorithms. 

The  implementation  coniplexity  criterion  is  a 
coarse  nieasure  of  w'cight,  size  and  po'ver  require¬ 
ments  for  each  encoder.  The  inherently  simple  oper¬ 
ation  of  DPCM  coders  combined  with  esseniiallv  n 
single  frnnic  of  storage  favors  the  DPCM  interframe 
coder.  Conversely,  the  multiple  frame  storage 
requirements  of  the  three-dimensional  transform 
coders  sov'crely  limit  their  usefulness.  The  two 
hybrid  transform^predictive  encoders  appear  to  be 
the  best  compromise  as  they  combine  single  frame 
storage  requirements  with  the  simplicity  of  DPCM  in 
the  temporal  domain, 

A  sunmiary  of  the  results  of  the  systems  analysis 
for  the  interframc  encoders  is  contained  in  Table  1. 
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APPENDIX  D 


THE  PRIME  COSINE  TRANSFORM 


THr  PRIME  COSINE  TFIANSFORM 
J.  Speiser 

NAVAL  UNDERSEA  CENTER 
SAN  DIEGO,  CA  92132 


This  note  discusses  a  high  speed  implementation  of  the  odd  discrete  cosine 
transform  (ODCT)  which  eliminates  the  multipliers  required  in  earlier  imple¬ 
mentations  [1]  based  on  the  chirp-Z  transform.  The  discrete  cosine  transform 
is  useful  for  television  data  compression  since  its  basis  vectors  closely  approx¬ 
imate  those  of  the  optimum  Karhunen-Loeve  transform  for  exponentially  corre¬ 
lated  data  [2]. 

The  ODCT  is  defined  as  the  first  N  Fourier  coefficients  of  the  length  2N-1 
even  extension  of  the  data,  assuming  that  the  data  consists  of  N  real  values. 

This  is  shown  in  eqn  (1). 
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k 


N-1 

Z 

n=  -(N-1) 


^-i2TTkn/(2N-l) 


for  k=0,l 


.N-1 


(1) 


where  g_^  = 

In  order  to  be  able  to  use  a  variant  of  Rader's  Prime  Transform  algorithm 
[3]  we  assume  that  P  =  2N-1  is  a  prime.  Note  that  the  data  block  length,  N, 
need  not  be  a  prime,  as  shown  in  the  Table. 

It  has  been  previously  shown  that  if  P  is  a  prime,  the  discrete  Fourier 
transform  (DCT)  of  length  P  can  be  implemented  using  a  circular  convolution  of 
length  P-1  together  with  two  analog  permuter  memories  of  length  P  [4].  It  will 
be  shown  here  that  the  symmetry  of  the  extended  data  in  equation  (1)  permits 
the  size  of  the  circular  convolution  and  the  permuter  memories  to  be  reduced  by 
a  factor  of  two. 


For  each  prime  P,  there  is  an  integer  R,  called  a  primitive  root  of  P, 
such  that  the  residues  of  R,  R^,  ...  are  all  distinct  modulo  P,  and  include 

every  nonzero  residue  modulo  P  [5].  Therefore,  for  each  integer  n  not  congru¬ 
ent  to  zero  mod  P,  n  can  be  represented  uniquely  as  a  power  of  R  modulo  P,  say 
n  =  R’^  (mod  P).  The  integer  n'  is  called  the  index  of  n  (mod  P)  with  respect 
to  the  primitive  root  R.  In  effect,  R  plays  the  role  of  the  base  of  a  system 
of  logarithnein  modulo  P  arithmetic,  and  n'  is  the  logarithm  of  n.  This  repre¬ 
sentation  is  useful  because  it  allows  us  to  replace  multiplication  by  addition 
in  the  exponent  of  the  DFT,  and  thus  reduce  the  DFT  to  a  circular  correlation, 
as  shown  in  equation  (2). 


^Rk' 


=  So  * 


N-1 

Z 

n'=  -(N-1) 


-iZirR 


n'+k' 


Srh’ 


2N-1 


(2) 


n'  ^  0 


71 


p 


Since  zero  does  not  have  an  index  (logarithm)  with  respect  to  the  primi¬ 
tive  root,  the  zero  frequency  point  in  the  transform  must  be  computed  sepa¬ 
rately,  as  shown  in  equation  (3) . 
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N-1 
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n=  -(N-1) 


N-1 
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(3) 


The  interpretation  of  equation  (2)  is  that  the  DFT  coefficients  in  per¬ 
muted  order  are  obtained  by  adding  gQ  to  the  circular  crosscorrelation  of  a 
permuted  sinusoid  with  a  permuted  version  of  the  data  points  excluding  gQ. 

For  the  special  case  of  the  DCT,  the  symmetry  of  the  extended  data  allows  us 
to  replace  the  complex  exponential  by  a  cosine,  as  shown  in  equation  (4) . 

k'.n' 

CrK'  =  §0  *  ^  %n'  /(2N-1))  (4) 

n'=  -(N-1) 

0 


It  will  now  be  shown  that  the  permuted  data  and  the  permuted  cosine  have 
periodicity  N-1,  so  that  the  circular  correlation  of  length  P-1  =  2N-2  can  be 
reduced  to  a  circular  correlation  of  length  N-1.  First,  note  that  both  the 
extended  data  and  the  cosine  function  are  even.  Let  h  be  any  even  sequence. 
It  will  be  shown  that  h^s  has  period  N-1,  where  the  subscript  is  reduced 
modulo  P  =  2N-1. 

It  is  well  known  in  number  theory  that  =  -1  (mod  P)  [5].  In  our 

case,  (P-l)/2  =  (2N-2)/2  =  N-1.  Therefore  hj^s  +  (N-1)  =  lipS  j^N-1  =  h_j^s  =  hj^s 

Using  this  periodicity  property  applied  to  equation  (4)  lets  us  write  the 
ODCT  in  shorter  form,  as  shown  in  equation  (5). 
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(5A) 
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(5B) 


The  circular  correlation  required  for  equation  5A  may  be  implemented  by 
any  of  the  alternative  methods  for  implementing  a  circular  correlation.  The 
most  straightforward  would  be  to  use  a  transversal  filter  of  length  2(N-1)-1  = 
2N-3,  with  tap  weights  of  cos(2trR*/(2N-l))  ,  for  s  =  N-1 , . .  .  1 ,0, 1 , 2 . .  .N-1 . 

The  architecture  of  the  transform  is  shown  in  Figure  1,  and  is  virtually 
identical  to  that  of  the  prime  Fourier  transform,  except  that  the  prime  cosine 
transform  need  only  permute  real  data  and  filter  the  permuted  real  data  with  a 
filter  having  real  weights.  The  prime  cosine  transform  is  thus  considerably 
simpler  to  implement  than  a  prime  Fourier  transform  of  the  same  block  length. 
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A  suitable  analog  permuter  memory  has  been  developed  for  the  Naval  Undersea 
Center  as  a  minor  modification  of  a  commercially  available  serial  analog  memory*. 
The  commercially  available  serial  access  memory  stores  analog  samples  as  charges 
in  an  array  of  MOS  capacitors  under  the  control  of  read  in  and  read  out  shift 
registers.  The  permuter  memory  shown  in  Figure  2  differs  only  in  the  fact  that 
one  of  the  shift  registers  has  been  replaced  by  a  binary  decoder,  thus  allow¬ 
ing  the  data  to  be  reordered  by  an  external  control  signal. 


*Reticon  Corp.  Sunnyvale,  CA. 
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p  (prime) 


N  =  (P+l)/2  (data  block  length) 


2N-3  =  P-2  =  filter  length 


31 

16 

29 

61 

31 

59 

127 

64 

125 

251 

126 

249 

257 

129 

255 

Table  1 


Selected  Primes  and  the  corresponding  OCDT  lengths 
and  filter  lengths.  (The  filter  lengths  shown 
assume  that  the  data  is  not  recirculated 
or  reread  into  the  filter.) 


SAMPLE  AND  HOLD 


n  (HOLD  FOR  N  time  units) 


INTEGRATOR 
(INTEGRATE  FOR  N-1  time  units)! 


PERMUTER 

1 

— 

FILTER 

[— »|  PERMUTER 

1  >  — _ 

(the  switch  is  in  the  up  position  for  the  first  data  sample,  and  is  down  for 
the  remaining  N-1  samples) 

Figure  1.  Prime  ODCT  architecture. 
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FIGURE  2 

ANALOG  PERMUTATION  MEMORY 
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PRIME  TRANSKORM  SAW  DEVICE 


I  M  AKiip 

N;iv.il  UnilcrMr.i  (VfH«r 
San  Diego.  (  A  *>21 


ABSTRACT  A  ntctluHl  of  culcutating  the  discrete  Fourier  transform  through  the  prime  transform  jlgonlhin  with  surface 
jcousiic  wave  device^  is  prcH'iited.  The  method  is  similar  to  the  cliirp*Z  trjnsfivrni  (CZT)  technique  |4) .  and  utilizes  the  SAW  de- 
vice  us  a  trunsvervil  tiller  The  prune  transform  is  bawd  on  an  algorithm  which  uses  as  indices  the  set  of  integers  generated  by  the 
scries  K  "  nodulo  N  (*  «  1 .  2. .  .  N  -  M.  N  is  prime  and  R  is  an  integer  whose  special  property  relative  to  N  is  tliat  its  successive 

powers  modulo  V  are  disiiiu  t.  and  thcreform  form  a  permutation  of  the  integers  1.2 . N  -  I .  The  SAW  prime  transtorm  im¬ 

plementation  lu'  the  same  prinessing  speed  advantage  as  the  SAW  C7 T  implementation,  namely,  that  it  computes  a  discrete  Fourier 
transform  with  speed  commensurate  to  a  fully  pipelined  KFT  running  at  the  same  sample  rate.  The  attributes  of  small  size,  light 
weight.  a»d  interconnection  simplicity  arc  also  maintained. 


I  he  ITimc  i  ranstorm 

The  discrete  Fourier  transform  (DFT)  of  a  sampled  data  se¬ 
quence  has  special  properties  when  the  number  of  points  to  be  irans 
formed  is  prime  ( 1 )  For  each  prime  number  N  there  exist  integers, 
known  as  "primitive  roots.”  whose  successive  integer  powers  modulo 
N  will  generate  a  permuted  version  of  the  sequence  1,  2.  ...  N  -  I  (21 . 

The  DFT  ( I )  can  then  be  written  in  terms  of  the  permuted  integer  se¬ 
quence  for  nen-zero  values  of  the  time  and  frequency  indices,  so  that 
(2)  results 

N-1 

0^“^  g„cxp(-j:irnm/N)  (l> 

n*0 

\-l 
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where 


m  =  k*‘  niotl  N.  k  *  0.  1 . 

.  .  N  -2 

n  *  R*^  mod  \.  C  »  1.2. 

.N-1. 

Threct  permutation  ot  sampled  jnjK>g  data  can  be  acliuved. 
for  example,  with  a  modified  version  of  a  commercially  avai’  da- 
vice.^  flic  auxiliary  computation  to  account  fur  the  contribution  of 
the  zeroth  sample  can  be  carried  out  in  a  number  of  different  ways, 
as  are  shown  in  Figure  2 


and 


Figure  2  Four  Ways  to  Include  Auxiliar>  Computation  for  Zeroth- 
Sample  input 


Fz)uation  (2)  represents  a  circular  convolution  with  auxiliary 
operations,  and  as  such  is  suitable  for  implementation  by  tiansversal 
fiftcr  architectures  (3)  The  filter  tap  weights  are  given  by  the  expres 
Sion  exp  (•j2irR^/N).  and  are  permuted  values  of  a  complex  sinusoid 
The  sequence  is  a  permuted  version  of  the  input  sample  values 
with  the  first  sample  deleted  The  transform  output  coefficienls 
are  the  m  »  I  to  N  -  1  Fourier  coefficienls  in  permuted  order  and 
is  computed  separately  A  comparison  of  the  basic  prime  transform 
implementation  with  that  of  the  CZT  is  shown  in  Figure  I  (also  see 
14.51). 
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For  a  given  pnme  N.  there  exists  a  selection  of  integers  between 
0  and  N  which  qualify  as  primitive  roots  The  number  of  such  roots  is 
given  by  the  Fuler  0-function.  (RN-I ),  For  example,  there  arc  8  primi¬ 
tive  roots  associated  with  N*3I  3.  II.  12.  13.  17, 21, 22.  and  24 
Thus,  there  are  (RN-I )  different  ways  to  fabricate  a  pnme  transform 
device  of  length  N  The  variety  of  related  puKe  compression  codes  is 
discussed  in  |71  Primitive  roots  for  various  primes  are  listed  in  |8l 
Fz|uation  (3)  illustrates  the  Fuler  (3-f unction 


0(A) 


a  -  I  .  b  -  I  c  -  I 
a  b  c 


‘3» 


where  A  "  a^b^cT' 
are  integers 


and  a.  b.  c  are  distinct  primes,  and  a.  d  y. 


^Reiieon  Corp  Mode)  SAM  (wnpM  analof  memory ). 


*To  he  ptiMUu^  m  Synmatlim  froetr^l^  IBM  (1975). 
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S  \VV  Implemcnution 


A  >ur>a>.c  jvou’jh.  wtfVtf  (SAW  )  Jcvkc  doM|$iK‘d  and  con- 
Mrii*tvd  ai  MC  u>  vcnl>  tlie  feasjbilit>  of  the  prime  iranslorm  so 
impIv'iJUMjicd.  Tlw  pnnu  number  uas  chosen  to  be  31 .  and  the  priini- 
live  root  3  I  hi%  led  to  the  selection  of  the  tap  weights  tabulated  in 
Table  I  Since  circuiai  convolution  was  required,  two  periods  less  one 
sample  ol  the  30-tap  sequence  were  incorporated  into  the  lapping 
structure,  resulting  in  a  total  of  59  complex  laps  The  complex  arith¬ 
metic  was  implemented  by  using  real  and  imaginary  parts  of  the  speci¬ 
fied  tup  response  to  determine  amplitude  weightings  on  two  parallel 
acoustic  paths  driven  b>  a  common  acoustic  input  signal. 


fable  I  Pnme  Transform  Tap  Weights.  N  *  31.  K  «  3. 
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The  entire  electrode  structure  and  a  closup  are  shown  m  Fig¬ 
ures  3  and  4.  The  device  was  built  using  AT-cut  quartz  as  the  sub¬ 
strate.  with  tap  spacings  of  .0256  inch,  equivalent  to  0.2  Each 
tap  consists  of  8  finger-pairs  with  wavelength  .0016  inch,  equivalent 
to  MO  MHz  The  taps  are  spaced  with  a  gap  equal  to  their  width,  and 
the  entire  air.iy  of  laps  with  one  input  transducer  is  1 .88  inches  in 
length  riie  aperture  of  the  laps  varies  according  to  the  specified 
weighting  function,  and  has  a  maximum  of  0.256  iniTi  The  aperture 
of  the  mpui  transducer  at  cither  end  of  the  device  is  0  59(>  in.h  The 
interconnect  busses  were  configured  so  that  either  all  59  or  just  .30 
!a|vs  (from  cither  endi  could  be  connected  together  to  form  itu*  real 
and  imjein.iry  output  signal  components 


I  igure  .3  l*rMiK'  lr.iiisf«»rm  S\W  lleclrtHle  Stiudiire 


Figure  4  Prime  Transform  SAVV  Flectrodc  Structure.  Closeup 


A  separate  uniformly  weighted  tapping  structure  (not  shown* 

I  was  designed  to  carry  out  the  zeroth-sample  auxiliary  calculation,  but 
was  not  used  in  initial  device  tests  The  resulting  device  is  capable  of 
calculating  a  31  -point  DFT  in  6.2  #<sec.  and  can  operate  at  duly  cycles 
uploSO'T.  Two  such  devices  can  be  used  alternately  to  achieve  lOCTr 
duty  cycle  when  required 

Experimental  Results 

The  procedure  used  to  view  the  impulse  response  of  the  S  \W 
prime  transform  device  is  illustrated  in  Figure  5  .A  qu.'.rter-wavc  vlelay 
(about  3  1  ns)  was  used  to  enable  quadrature  combination  of  the  real 
and  imaginary  parts  of  the  response,  and  the  sc*p.iraie  and  combiiK'd 
Signals  are  shown  in  Figures  b  and  ^  I  his  technique  fw  achieving  dis¬ 
crete  phase  moilulation  of  a  sign.il  c.inier  is  discussed  in  16) .  and 
could  have  lust  jseasit)  been  iiKorporated  into  the  initial  mask  de¬ 
sign  instead  of  being  left  to  electrical  manipulation  external  to  the 
S,A%  deVKe  as  performed  heT«*  lack  of  perfection  in  implementa¬ 
tion  of  live  i4>ecifK*d  tap  weights  due  to  missing  or  broken  fingers  will 
show  up  as  a  m)nimirornm>  in  the  complex  output  sample  magnitude, 
and  I  Igure  T  doc-s  show  ihis  to  some  extent 
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l-iyurc  "j  Irnpulsf  Response.  Taps  30*1.  Complex  Combination 


Figure  *^3  SAW  Prime  Transform  Outputs  at  Carrier  tSO  MHz)  t'pper. 
Input  frequency  =  I  c>cle.  Lower,  Input  Frequency  “ 

.'0  cyOes 


Figure  10.  SAW  Prime  Transform  Outputs,  Magnitude  Squared,  at 
Baseband:  Upper.  Input  Frequency  *  I  cycle;  Lower, 
Input  Frequency  =  30  cycles. 

Conclusions 

The  prime  transform  algorithm  implemented  with  SAW  trans* 
versal  filters  has  been  shown  to  be  a  powerful  tool  for  the  high'Speed 
evaluation  of  the  discrete  Fourier  transform.  At  a  5  MHz  sample  rate, 
a  3 1  point  DFT  can  be  calculated  in  6.2  psec.  whicli  represents  a  sub¬ 
stantial  increase  in  speed  o%er  con'entiona!  FIT  techniques.  The 
transversal  filter  prime  transform  is  very  similar  to  the  CZT  method, 
and  represents  an  alternative  to  it  when  permutation  of  the  data  being 
transformed  is  preferred  over  multiplication  of  the  data.  The  prime 
transform  requires  au\iliary  calculations  to  account  for  the  contribu¬ 
tion  of  the  zeroth  data  sample  and  to  evaluate  the  dc  transform  coef¬ 
ficient.  but  either  or  both  of  these  may  be  dispensed  with  if  the  ap¬ 
plication  docs  not  depend  critically  upon  them  Inverse  permutation 
of  the  filler  output  may  also  be  eliminated  if  not  needed. 

The  same  Umitacions  w-lth  regard  to  accuracy  apply  to  analog 
implementations  of  the  prime  transform  algorithm  such  as  this  as  also 
apply  to  CZT  analog  implementations,  which  limitation  is  about  the 
equivalent  of  ?  or  8  bits.  Like  the  CZT  filter,  the  transversal  filter 
prime  transform  implementation  affords  the  possibility  of  calculating 
both  forward  and  inverse  transforms  using  identical  modules,  and 
this  in  turn  leads  to  the  capability  of  real-time  high-data-rale  linear 
signal  processing  tools  such  as  cross-convolvers,  cross-correlators, 
adaptive  filters,  and  programmable  matched  filters. 
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APPENDIX  F 

MODULAR  IMPLEMENTATIONS 
OF  THE  DISCRETE  COSINE  TRANSFORM 
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J.M.  Speiser 
Naval  Undersea  Center 

ABSTW\CT 

A  modular  architecture  is  described  for  the  implementation  of  the  Even 
Discrete  Cosine  Transform  (EDCT).  This  architecture  permits  the  use  of  four  EDCT 
modules  to  compute  a  double  length  transform  with  twice  the  throughput  rate  of 
the  individual  modules. 

INTRODUCTION 

The  utility  of  the  EDCT  for  data  compression  has  been  previously 
described  [1-3].  Several  different  serial  access  implementations  for  its  high 
speed  computation  have  also  been  described  [4].  Implementations  using  charge 
coupled  device  (CCD)  transversal  filters  are  particularly  attractive  [2]  for 
applications  which  require  low  weight,  small  size,  low  power  consumption,  and 
controllable  clocking  of  the  cpmputation.  Present  CCD  transversal  filters  perform 
well  at  shift  rates  of  up  to  about  5  x  10^  samples  per  second.  This  is  a  factor 
of  two  too  slow  to  handle  conventional  television  signals  using  the  previously 
described  EDCT  architectures.  This  note  describes  a  subdivision  of  the  computation 
tasks  to  permit  greater  parallelism  in  the  hardware  to  increase  both  the  throughput 
and  transform  size  implementable  with  a  fixed  set  of  transversal  filter  and  chirp 
read-only  memory  modules. 

DCT  IMPLEMENTATIONS 

A  system  to  compute  the  EDCT  of  length  N,  implementing  previously 
given  equations  [4]  is  shown  in  Fig.  1.  It  differs  from  serial  access  implementations 
of  the  ODCT  [2]  primarily  in  the  sinusoidal  multiplication  following  the  chirp 
postmultiplication.  Despite  this  slight  complication  the  EDCT  was  selected  for 
modular  decomposition  rather  than  the  ODCT  because  an  odd  length  extension  of  a 
data  block  can  only  be  subdivided  into  a  even  length  data  block  and  an  odd  length 
data  block,  while  an  even  length  extension  of  a  data  block  can  be  subdivided  into 
two  data  bit  k%  of  the  same  length  in  order  to  permit  simultaneous  computation 
by  similar  modules. 

Since  intermediate  complex  quantities  need  to  be  preserved  in  the  modular 
decomposition,  the  complex  extended  discrete  Fourier  transform  (EDFT)  portion  of 


Fig.  1  was  chosen  as  the  basic  module.  This  module  computes  N  points  of  a  length 
2N  DFT  of  the  extension  of  the  data  sequence  by  N  zeroes.  The  interconnection  of 
modules  with  minor  auxilliary  components  to  perform  an  EDCT  with  doubled  throughput 
on  a  double  length  data  block  is  shown  in  Fig.  2. 


DERIVATION  OF  THE  MODULAR  EDCT 

Let  the  input  data  be  denoted  by  g^^  for  m=  0,1,... M-1.  Define  the 
symmetrized  extension  of  the  data  by  g_^_^  =  9^  for  m=0,l , . . .M-1 .  The  even 
discrete  cosine  transform  (EDCT)  of  g  may  then  be  defined  by  any  of  the  equivalent 
expressions  shown  in  equations  1-3. 
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Let  the  data  block  length  be  even,  say  M=2N.  Then  the  summation  in 
the  DCT  can  be  split  into  shorter  sums  as  shown  in  equations  4-5. 
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Each  of  the  summations  in  equation  5  may  be  interpreted  as  a  OFT  of  length 
2N  of  a  length  N  data  block  extended  by  N  zeroes.  The  first  N  coefficients  of  such 
a  OFT  are  computed  directly  by  the  EDFT  module  of  Fig.  1.  The  remaining  coefficients 
may  all  be  written  in  the  form  where  p=0,  1,...N-1.  The  corresponding  terms 

of  equation  5  are  examined  in  equations  6-9. 
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The  structure  shown  in  Figure  2  uses  equation  4  to  generate  the 
coefficients  Gq,G.|  , . .  and  uses  equations  6-9  to  generate  - ^2N-1 
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2N  -  1  TAPS 


iTTsViN  CORRESPONDING  TO  -i7rs*/2N 
s=-(N  l».•••(N-l>  ® 


»-i7r$/2N 


MODULE  TO  PERFORM  OFT  OF  SIZE  2N  OF  LENGTH  N  DATA 
EXTENDED  BY  N  ZEROS  (EDFT) 


SYSTEM  TO  PERFORM  AN  EDCT  OF  LENGTH  N 


System  to  Pcrlorm  an  FDCT  of  Length  N  Including  Module  to  Perform  a  DFT  of  Si7e  2N 
ol  a  Data  Block  of  Size  N  F.x tended  by  N  Zeros. 


89 


APPENDIX  G 

SIGNAL  PROCESSING  ARCHITECTURES 
USING  TRANSVERSAL  FILTER  TECHNOLOGY 
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SK.NAL  PROCESSING  ARCHITECTURES  USING 
I  R ANSVERSAL  FILTER  TEC  HNOLOGY  * 

II  ,1  W  liitL'houso,  R.  W,  Means,  and  J.  M.  Speiser 
Naval  Undersea  (  enter 


I 


INTRODUCTION 

A  large  portion  ot  the  eoinpiitational  load  for  many  signal  processing  problems  consists  ot  the  com- 
imtation  of  linear  translorms  I'or  time-invariant  linear  transforms  such  as  cross  convolution  or  matched 
filtering,  the  transversal  filter  provides  a  highly  parallel  computational  module  with  high  througliput  and 
minimal  control  overhead  ( 1  | .  This  paper  will  show  how  similar  computational  modules  can  be  configured 
to  provide  similar  computational  advantages  for  a  large  class  of  time-variant  linear  transforms  including 
onc-dimcnsioiial  and  niulti-dimensional  discrete  I'ourier  transforms  and  one-dimensional  and  twi>- 
dimensional  discrete  cosine  transforms.  Furthermore,  time-variant  transform  modules  mav  be  combined 
to  implement  high  capacity  time-invariant  linear  transforms.  The  implementation  of  these  techniques  using 
surface  acoustic  wave  (SAW)  and  charge  coupled  device  (('CD)  technology  permits  the  real-time  solution  of 
several  important  signal  processing  problems  including  image  data  compression,  wideband  radar  signal 
analysis  and  spread  spectrum  communications. 


I 


I 

1 


COMPUTATIONAL  MODULES 

A  linear  transform  on  sampled  data  of  finite  e.xtent  may  be  viewed  as  the  multiplication  of  a  vec¬ 
tor  by  a  matrix.  Multiplication  by  diagonal,  circulant,  or  Toeplitz  matrices  may  be  accomplished  rapidly 
with  simple  computational  hardw'are  modules.  Multiplication  by  an  N  X  N  diagonal  matrix  requires  only 
a  scalar  multiplier  and  a  memory  containing  N  values  to  provide  serial  access  to  the  reference  function 
Multiplication  by  an  N  X  N  Toeplitz  matrix  corresponds  to  a  convolution  and  may  be  performed  using 
a  transversal  filter  having  2N-I  taps  Multiplication  by  an  N  X  N  circulant  is  a  special  case  of  multiplica¬ 
tion  by  N  X  N  Toeplitz  matrix  in  which  the  length  of  the  transversal  filter  may  be  reduced  to  N  taps  it 
the  data  block  is  recirculated  through  the  filter  or  reread  into  the  filter  from  a  buffer  memory. 


ONE-DIMENSIONAL  DFT 

Linear  filters  have  been  used  for  many  years  for  the  calculation  of  the  power  spectra  of  continuous 
signals.  One  of  the  earliest  methods  used  a  bank  of  wave  filters  to  measure  the  spectra  in  fractional  octave 
bands  for  telephone  network  equalization  |  21 .  However,  when  increased  resolution  was  required  the  num¬ 
ber  of  filters  rapidly  become  unmanageable.  An  alternative  which  overcame  the  difficulty  of  a  large  num¬ 
ber  of  filters  each  with  small  time-bandwidth  product  was  to  substitute  one  linear  fm  (chirp)  filter  with 
large  time-bandwidth  product  and  to  employ  matched  filtering.  In  this  system  the  signal  to  be  analyzed 
is  used  to  single  sideband  (SSB)  modulate  a  locally  generated  chirp  signal  and  the  composite  modulated 
signal  is  filtered  in  a  chirp  delay  line  matched  filter.  Each  component  of  the  input  signal  spectrum  shifts 
the  locally  generated  chirp  to  a  different  position  in  the  spectrum  after  SSB  modulation  and  these  shifted 
chirps  then  correlate  with  the  reference  signal  represented  as  the  impulse  response  of  the  matched  filter  at 
different  times.  I  hus  the  output  signal  amplitude-time  history  reflects  the  amplitude-frequency  composi¬ 
tion  of  the  input  signal. 


*To  be  published  in  Proceedings  of  the  1975  IEEE  Internationa  Symposium  on  Circuits  and  Systems  (Boston.  21-23  April,  1975). 


93 


Blcu'.tcin  1  3 1  rccogni/eci  that  the  discrete  l-oiirier  iranstoiin  ( Dl- 1  )  ol  sampled  data  as  amenahle 
to  a  similar  interpretation.  In  addition  to  just  calculating  the  magnitude  of  the  1  ourier  translorm.  linear 
filters  could  calculate  the  phase  and  thus  all  of  the  operations  such  a  cross  convolution  and  a  cross  correla¬ 
tion  could  be  calculated.  I  his  technique  came  to  be  called  the  chirp-/,  traiistorm  ((  /  I  i  and  can  he  applied 
It)  other  problems  besides  Just  the  calculation  of  the  DFT  14) .  Prior  to  these  developments,  digital  compu¬ 
tation  of  the  DFT  had  been  significantly  improved  by  the  use  of  a  special  algorithm  called  the  last  Fourier 
transform  (  FFT)  which  was  described  by  Cooley  and  Tukey  1 5  | .  1  he  FF'l  algorithm  gamed  rapid  popu¬ 
larity  in  signal  processing  since  it  allowed  the  calculation  ol  the  DFF  to  be  done  using  signilicaiitly  fewer 
machine  operations  (multiplications)  than  direct  evaluation. 

The  DFT  can  be  defined  with  many  normalizations  just  as  the  continuous  Fourier  translorm  ean 
be.  For  this  paper  the  definition  of  the  direct  or  forward  transform  of  a  complex  vector  g  of  length  N 
is  given  as 


N-1 

n=0 

and  the  inverse  transform  as 

n=U 


where  F  is  an  N  X  N  matrix  with  elements  F^ 

By  direct  inspection  it  is  observed  that,  if  symmetries  of  the  function  exp  jTrJnm  N  are  not  ex¬ 
ploited.  then  the  number  of  complex  multiplications  required  will  be  N'  corresponding  to  N  multiplica¬ 
tions  for  each  frequency  component  evaluated.  Hven  on  high  speed  digital  computers  this  can  become 
the  limiting  consideration  in  signal  processing  applications.  The  advantage  of  the  FFT  algorithm  is  that 
for  highly  composite  values  of  the  DFT  size  N  the  number  of  multiplications  is  proportional  to  N  log^  N. 

Although  the  FFT  has  been  succes.sful  in  substantially  reducing  the  computing  time  and  cost  of 
using  general  purpose  digital  computers  it  has  several  disadvantages  for  special  purpose  real  time  computa¬ 
tion.  At  high  throughput  rates  which  are  required  for  real  time  image  processing  the  processor  either  must 
operate  logsN  times  faster  than  the  data  rate  or  pipeline  structures  which  use  distributed  memory  and 
logsN  multipliers  must  be  used.  In  addition,  the  internal  arithmetic  ol  the  FIT  processor  must  be  done  at 
increa.sed  precision  in  order  to  compensate  for  the  multiple  rouiul  oil  errors  introduced  by  the  successive 
stages  in  the  FFT  processor.  Although  these  difficulties  can  be  overcome,  it  is  not  always  possible  to 
arrange  the  computation  in  a  form  where  the  size  of  the  transform  is  highly  composite.  For  the  above  rea¬ 
sons  and  because  of  the  difficulty  of  obtaining  small,  low  power,  fast  analog  to  digital  converters,  linear 
transversal  filter  inplementations  of  the  chirp-Z-transform  are  attractive  |b|  rather  than  the  previous  (  ZT 
implementation  which  used  an  FFT  to  perform  the  required  convolution. 

The  DFT  may  be  easily  reduced  to  the  form  suitable  for  linear  filtering  by  the  substitution 


which  changes  a  product  of  variables  into  a  difference  so  that 


N- 


(.  =,.-jtrm-  N  ^  J7r(n-ni)-  N  ..-jmi-'N,, 
'’m  ^  '■  '■ 

n=0 


(4) 
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I  Ills  lumi  is  sci.'u  III  Ih'  I'liiiii.ili'iit  In  Kicturiag  ilic  l  oiiricr  matn\  F  iiun  Ilic  proiUiiI  ol  Ihioi'  m.iiiui's 


F  1)11)  I 'I 

■) 

ivhcro  I)  IS  j  ili.ieon.il  iii.ilii\  ivilli  elements  =  expl-jirn"  Nl  .iiui  1  is  .i  loeplit/  matrix  with  elements 
I  nm  ~  ^  • 

I  he  (  /  I  aleoiilhm  is  easily  implemented  by  transversal  lilter  teehnniues  In  this  ease  the  1)1  1 
is  eompiited  In  pieiiuiltiphialion  by  a  iliserete  chirp,  eonvolntion  w  ith  a  discrete  chirp,  and  postimillipli- 
calinn  In  .i  discrete  chiip  I  igtire  I  shows  this  conl'igiiralion.  Mowever.  it  must  be  remembered  that  botli 
the  miiliiphcations  and  convolutions  are  complex  and  a  suitable  representation  ol  the  complex  numbers 
must  be  Used.  One  representation  is  by  real  and  imaginary  part.  Figure  2  shows  the  DFT  orgam,  ed  as  a 
CZ T  and  implemented  w  ith  parallel  computation  of  the  real  and  imaginary  parts.  In  Figure  2  the  input 
signal  is  represented  as g  =  +  jg|  and  the  output  signal  is  represented  as  (/'  =  f'R  +  j0'|.  where  it  is  under¬ 
stood  that  g  =  -a,,  n  =  0.  .  N  -  I  and  G  -  n  =  0 . N  -  I . 

In  order  to  determine  the  specific  form  of  the  transversal  filters  it  is  necessary  to  know  the  specitic 
value  of  N.  When  N  is  odd  the  Toeplit/  matrix  T  may  be  represented  as  a  transversal  filter  with  2N  -  I  com¬ 
plex  taps  h_(|yj_]  j  to  h|y|_|  where  =  W"  n  = -(N-h  to  (N-! ).  and  W  =  exp  (-j2ir/N>.  The  reipiired 
convolution  has  heen  implemented  with  the  general  transversal  lilter  shown  in  Figure  .T 

When  N  is  even,  it  can  be  shown  that  T,,  where  the  subscripts  are  reduced  mod  N. 

Thus  I  IS  a  circulant  matrix  and  can  be  implemented  with  a  recirculatine  transversal  filter  as  shown  in 

Figure  4  where  the  number  ol  complex  taps  is  N  and  lap  weights  are:  h,,  =  n  =  0 . N  -  I : 

In  the  specitic  case  when  N  is  an  odd  prime,  additional  simplification  is  possible.  It  is  possible  to 
eliminate  the  multipliers.  The  DFT  may  be  written  as 


N=1 


^0  ~ 

to) 

n=() 

t  or  m  =  1 .  . 

.  .  .N  -  1 

(7) 

n=  1 


*  Denotes  either  convolution  or  circular  convolution 


Figure  1.  Chirp-Z-Transform  Implementation  of  the  DFT 
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r° 

- 

i''N-2  *'n-3  J 

1  *N  U 

J'igurc  4.  Circular  ('onvokition 


Since  only  non/.cro  values  of  n  and  ni  occur  in  the  right  hand  side  of  hquation  (7)  it  is  possible  to 
use  Causs's  analogy  between  logarithms  and  indices  with  respect  to  a  primitive  root  |  7 1  to  replace  the 
product  nm  by  a  primitive  root  raised  to  a  sum.  thus  reducing  this  computation  to  a  correlation  between 
permuted  functions  (b|.  In  matrix  notation 

-  g^^  /  =  F'  n'  (Sa) 


where 


/  = 


(8h) 


and  the  matrix  F’  can  be  factored  into  the  three  matrices 


F'  P'CP. 


(‘M 


where  P  is  a  N  -  I  permutation  matrix.  C  is  a  ( N  -  I  X  N  -  1 1  circiilaiit  matrix.  P*  is  the  transpose  of  P.  (/' 
and  g  are  column  vectors  of  si/e  N  -  I  derived  from  O'  and  g  by  deleting,  respectively.  Gq  and  g^.  The 
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elements  of  the  matrix  C  are  (  p  ni  ~  n)  ( ml  ~  I . N  -  I  and  ( n)p  -  r”  mod  N.  where  i  is  a 

primitive  root  of  N  and  the  elements  ot  P  are 


P  =  ft 
'nm  "(niptn 


n.  Ill  =  1 _ N  -  1 


(  lOl 


where  is  the  Kroneeher  delta  and  p  implies  permuted. 

Thus,  for  the  ease  where  N  is  an  odd  prime.C'isa  eireulant  matrix  and  can  he  implemented  with  a 
reeireulating  transversal  filter  whose  tap  weights  are 


II 

n  =  1  . 

.  .  N  -  1 

t  1  la) 

(n)r, 

=  W  P 

n  =  1 , 

.  .  .  N  -  1 

(lib) 

and  the  Fourier  transform  coefficients  are  given  by 


N-1 

^o  ~  y  ^ 

n=0 


( i:a) 


and 


^m  ■  ^'m  ®0' 


m  =  I  .  .  .  N  -  1 


( i:b) 


These  concepts  are  illustrated  in  Figure  5  for  N  =  5.  Thus,  the  one-dimensional  architectures  may  be  sum¬ 
marized  as  a  time-var>ing  operation,  a  convolution,  and  a  second  time-varying  operation  where  for  arbitrary 
N  the  time-varying  operations  are  multiplication  by  a  diagonal  matrix;  and  where  for  the  special  case.  N  is 
an  odd  prime,  the  time-varying  operations  are  multiplication  by  a  permutation  matrix  and  its  transpose. 


Figure  I  xample  of  the  Prime  I  ransform  when  N  =  .S 
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IMPLEMENTATION 


Many  lypcs  of  transversal  itllei  nnplomentalions  may  be  easily  aceoinplished  using  analog  sampled 
data  leehniiiues  |‘>| .  I  aeh  oUeis  s*Mue  advantages  and  some  disadvantages  in  ain  particular  application, 
although  they  are  all  architecturally  similar.  In  this  paper  the  implementations  to  be  discussed  are  surtace 
acoustic  wave  (SAW)  devices,  charge  coupled  devices  (CCD)  and  hybrid  analog-digital  correlators. 


SURFACE  ACOUSTIC  WAVE  DEVU  ES 

Surface  Acoustic  Wave  (SAW  )  devices  can  accept  either  analog  or  sampled  analog  input  and  the  out¬ 
put  is  analog.  A  substrate  of  pie/»K-lectric  material  is  polislied  on  one  face  and  a  pattern  ot  aluminum  or 
other  conductor  is  deposited  h\  photolithographic  techniques.  In  its  simplest  configuration,  sets  ot  inter- 
digitated  finger  electrodes  are  spaced  .il  the  sampling  rate  distance  through  the  use  of  the  relationship  d  = 
Co  t  where  d^  is  the  tap  spacing.  C^  is  (he  Rayleigli  wave  velocity,  and  t^  is  the  sampling  increment.  For 
a  typV-al  substrate  of  ST-cut  quart/.  (  «  '  3mm  ,isec  and  for  a  typical  sampling  increment  ot  150  nsec, 
d  =  0  45  mm  and  a  1 50  point  Dl  T  can  be  implemented  by  recirculating  convolution  in  an  active  length  less 
than  75  mm  (.^  in.)  on  a  100  mm  (4  in.)  substrate.  A  simple  real  transversal  filter  is  sliown  in  Figure  (.  along 
with  the  method  of  establishing  the  l.ip  weights.  A  prototype  complex  filter  1101  is  shown  packaged  m 
Figure  7.  A  similar  device  has  been  used  as  the  premultiplication  and  postmultiplication  reterence  lunction 
generator  and  double-balanced  mixers  have  been  employed  as  the  multipliers 

Physical  limitations  on  the  si/e  of  available  substrates  limit  the  available  DFT  si/.e.  For  an  active 
length  of  substrate  L  =  1 50  mm  and  a  surface  wave  velocity  Cr  =  3  mm  '#isec  the  maximum  transform 
si/e  N„,,  for  a  data  rate  F,  is  approximately  N„,3,  =  L/,/(  «  -  50  F\  with  F,  m  megaHert/  (  urrent  SAW 

Ilia  A 


Figure  6.  Surface  Wave  Transversal  Filter 
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CZT  TRANSVERSAL  FILTER 


Figure  7.  32  Complex  Tap  SAW  CZT  Filter 


trariNversa]  filters  operate  at  sample  rates  from  1  to  100  MHz  with  carrier  frequencies  from  5  to  500  MHz 
with  typical  fractional  bandwidths  of  about  I  (f  v .  Although  there  are  some  piezoelectric  semiconductors, 
silicon  is  nonpiezoelectric.  Heteroepitaxial  materials  such  as  Aluminum  Nitride  (AIN  )  and  Silicon  (Si) 
cogrown  on  a  sapphire  substrate  are  currently  under  development  for  monolithic  integrated  circuits  where 
The  AIN  is  the  piezoelectric  and  Si  is  the  semiconductor.  Such  monolithic  circuits  should  make  possible 
monolithic  C'ZTs  at  100  to  500  MHz  data  rates. 


CHARGE  COUPLED  DEVICES 

CCDs  are  sampled  data  analog  circuits  which  can  be  fabricated  by  Metal  Oxide  Semiconductor 
(MOS)  technology  as  LSI  components  |  1  1 1 .  As  such  they  are  directly  compatible  with  other  MOS  circuits 
Current  CCD  transversal  filters  have  operated  as  video  devices  with  sample  rates  up  to  5  MHz.  CCDs  oper¬ 
ate  by  the  manipulation  of  injected  minority  carriers  in  potential  wells  under  MOS  capacitors  and  thus 
behave  as  capacitive  reactances  with  low  power  dissipation.  However,  since  the  potential  wells  which  con¬ 
tain  the  minorit>  carriers  also  attract  thermally  generated  minority  carriers,  there  is  a  maximum  storage 
time  for  the  analog  signal  which  depends  on  the  dark  current  associated  with  the  temperature  of  the  silicon 
Under  normal  conditions  at  room  temperature,  dark  evirrents  are  tens  of  nAmps/cm“  and  storage  times  of 
hundreds  of  milliseconds  can  be  achieved. 

There  are  many  ways  in  which  unidirectional  charge  transfer  can  be  achieved.  The  first  developed 
was  a  three-phase  clocking  structure  which  is  illustrated  in  the  transversal  filter  of  Figure  b.  The  three 
electrode  CCD  structure  is  planar,  much  like  the  S.AW  ilevices.  and  the  direction  of  charge  propagation  is 
determined  by  the  sequence  of  potentials  applied  to  the  three  electrodes.  Unfortunately,  if  the  minorits 
earners  are  allowed  to  collect  at  the  semiconductor-oxide  boundary,  poor  charge  transfer  efficiency  will 
result  due  to  minority  carriers  getting  caught  in  trapping  sites.  I  liis  means  that  the  CCD  will  behave 
nonlinearly  unless  tliere  is  sufficient  propagating  charge  present  to  fill  all  of  the  traps  By  biasing  the 
operating  condition  of  the  CCD  so  that  about  I  O'  i  of  the  dynamic  range  is  used  for  the  injection  of  a  "tat 
zero."  the  traps  arc  kept  continuou  ly  filled  and  the  device  h.is  over  a  fiO  cIB  dynamic  range  In  practice. 


l-iiiuic  .s  Sclicin.iiii.'  ol  tlic  S.nnpliiiii^  .iiui  Simiiiim;^  Operation 


a  \idco  signal  representing  the  signal  to  be  processed  is  aJdoii  li>  a  lisin!  bias  son1es^hat  larger  than  I'lie-lult 
ol  the  peak-to-peak  vakie  ot  the  signal.  Since  the  el't'ectoc  storage  time  ol’  the  device  is  long  relative  t(v  the 
time  required  to  e.secute  a  convolution.  CCDs  can  be  considered  to  I'C  interruptible  signal  processors  and 
as  such  are  more  compatible  with  the  executive  control  required  tor  signal  processing  A  b4  point  CCD 
lilter  will)  discrete  cosine  transtbrm  sine  and  cosine  chirps  is  shown  in  Figure  ‘k  This  chip  was  developed 
by  1  exas  Instruments  lor  the  Naval  Ihidersea  Center  tor  image  pnvcessmg  The  discrete  cosine  transform  is 
described  in  a  subsequent  section. 

Current  research  in  CCIDs  is  directed  toward  improving  tlie  charge  transter  etticiencv  and  removing 
the  requirement  of  continuous  "fat  zero”  charge  injection  by  ion  implantation  techniques  which  keep  the 
minority  carriers  away  from  the  semiconductor  oxide  boundary  Ion  implantation  is  also  being  used  to 
provide  asymmetric  potential  wells  so  that  simpler  two-pliuse  clocking  can  he  employed.  Currently  avail¬ 
able  CCDs  have  500  stages  with  0  transfer  efficiency  and  devices  with  up  to  2000  stages  are  planned 

.Another  charge  transfer  device  similar  to  the  CCD  is  the  Bucket  Brigade  Device  (BBDi  I  his  is  a 
Nequence  of  MOS  transistors  coupled  together  by  diffusion  enhancevi  Miller  capicitance  .Although  these 
devices  do  not  operate  at  frequencies  as  high  as  CCDs.  Iliey  have  better  low  Irequency  perlormance  since 
they  include  active  devices.  A  i  7.T  has  been  implemented  with  two  BBD  chips.  Two  200  tap  filters  are 
implemented  on  each  chip:  one  a  discrete  cosine  and  the  other  a  discrete  sine  lilter  The  device,  the 
complex  chirp  used  in  the  premultiplier  and  a  typical  input  and  output  are  shown  m  Figure  10  The  in¬ 
put  IS  an  offset  cosine  wave  and  the  output  shows  a  D.(  .  component  plus  a  response  at  the  ci'sme  wave 
frequeiicv .  These  filters  can  operate  at  100  kll/  and  have  tap*  accuracies  bcttei  than  T.  Nk  ith  carelul  con¬ 
trol  ot  geometry,  both  BBD  and  Cf  D  filters  with  tap  accuracies  approaching  tl  I  •  should  be  possible  This 
chip  was  also  developed  by  Texas  Instruments  for  the  Naval  Ihidersea  (  enter 
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Figure  M  Point  C'C'D  Filters 


A  hybrid  binary-analog  C'CD  correlator  module  with  32  analog  taps  1 1  2 )  has  been  made  for  the 
Naval  Undersea  Center  by  General  Flectric  and  is  shown  diagrammatically  in  Figure  I  I  The  maximum 
clock  rate  of  the  module  is  4  MHz.  These  modules  may  be  cascaded  to  increase  the  correlation  length  to 
provide  an  analog  versus  multilevel  cross  correlator  with  analog  output.  The  correlator  module  uses  charge 
propagation  through  only  3  stages  since  the  CCD  is  arranged  in  32  stages  of  3  samples  each.  By  transverse 
shifting  of  the  charge  instead  of  longitudinal  shifting,  charge  transfer  inefficiency  degradation  is  avoided 
and  the  modules  can  be  cascaded  to  very  large  sizes.  However,  with  this  configuration,  dark  current  non¬ 
uniformity  must  be  controlled  to  prevent  time  variable  pattern  noise  from  contributing  to  the  output.  Sys¬ 
tematic  errors  in  the  tap  weights  can  be  measured  and  stored  and  the  analog  signal  corrected  for  the  mea¬ 
sured  variation  in  uniformity  before  it  is  stored  in  the  CCD  registers. 


DISCRETE  COSINE  TRANSFORM 

Closely  related  to  the  DFT  is  the  discrete  cosine  transform  (DCT).  Two  different  types  of  IK  I  s 
are  useful  for  reduced  redundancy  television  image  transmission.  Both  are  obtained  by  extending  a  length 
N  real  data  block  to  have  even  symmetry,  taking  the  discrete  Fourier  transform  (Dl- 1)  of  the  extended  tiata 
block,  and  saving  N  terms  of  the  resulting  DFT.  Since  the  DM'  of  a  real,  even  sci|uenec.  is  a  leal,  even  se- 
viuencc.  either  DCT  is  its  own  inverse  it  a  normali/ed  Dl  f  is  used. 

The  "Odd  DCT  "  (ODCT)  extends  the  length  N  data  block  to  length  2N-I.  with  the  midille  point 
of  the  extended  block  as  a  center  of  even  symmetry.  The  "Even  DCl  ”  (I  IX'l  )  extends  the  length  N  data 
block  to  length  2N.  with  a  center  of  even  symmetry  located  between  the  two  points  nearest  the  middle. 
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For  example,  the  odd  length  extension  of  the  sequence  gQg^g-.  is  gig|gQg|g-(.  and  the  even  length  exten¬ 
sion  is  gsgjgQgQgigs-  In  both  cases,  the  syniinetri/.ation  eliminates  the  jumps  in  the  periodic  extension  of 
the  data  block  which  would  occur  it  one  edge  of  the  data  block  had  a  liigh  value  and  the  other  edge  had  a 
low  value:  in  effect  it  performs  a  sort  of  snu)i>thing  operation  with  no  loss  of  information.  It  will  be 
noted  that  the  terms  "odd”  and  "even"  In  OIK  T  and  FIXT  refer  only  to  the  length  of  the  extended  data 
block  in  both  cases  the  extended  data  block  has  even  symmetry.  Both  types  of  DCT  may  be  implemented 
using  compact,  high  speed,  serial  access  hardware,  in  structures  similar  to  those  previously  described  for 
the  chirp-Z  transform  (CZ1  )  implementation  of  the  DM  . 

Let  the  data  sequence  be  gQ.  g| . FN-1  •  T  of  g  is  defined  as 

N-1  -i-ffnk 

C,k=  ^  g„  fork  =  0.  1 . N-l  (I.D 

n=-(N-l) 
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Figure  1  1 .  Schematic  Diagram  of  a  Binary-Analog  CCD  Correlator  Module 

where  =  gjj  for  n  =  0,  I ,  .  .  .  ,  N  -  I .  By  straiglit forward  substitution  it  may  be  shown  that 
N-l  -j27rnk 

=  5-n 


where  g  is  defined  by  equation  ( 14a) 

~  _  (0  5gQ,  n  =  0 

\  gn-n=> . N-l 


The  identity  (15)  may  be  used  to  obtain  the  CZT  form  of  the  ODCT  shown  in  equation  ( 1 6). 


2nk  =  n“  +  k“-(n-k)^ 


-jjrk-  N-i  -jffn- 


jirtn-k)^  I 


Gk  =  2RJe2N-l  ^ 


2N-I  ~ 
g„ 


Jir(n-k 

2N-I 


riic  IDC  1  ot  g  is  del'incd  by  equation  (  1  ’’a),  where  the  extended  sequence  is  delined  In  equation 


I  1 '  h ) . 


(;k=e 


N-l 

:n  ^ 

n=-N 


-j^Trnk 

:n 


for  k  =  0.  1 . 


g_l_n=g„  torn  =  0,1 . N-l 

If  the  mutually  eomplex  conjugate  terms  in  equation  ( 17a)  are  combined,  then  equation  ( IH)  results 
bquation  ( IX)  may  be  viewed  as  an  alternate  way  of  defining  the  HDCT. 
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r s  „  l-rrtn  +  0.5)k  1 

C.k-2  ^  S„cos  - 1 

r>=:n  L  J 


Equation  ( IX)  may  be  put  in  the  CZT  format  as 


i  -jrrk  -Jrrk-  N-  i  -jn-n 

g,  =  :r,. 

(  n=0 


-jrrn  ~  jrrtn-k)" 
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Ahmed  1  131  has  investigated  the  use  of  the  HIK'T  as  a  substitute  for  the  Karhunen-l  oeve  trans¬ 
form  for  exponentially  correlated  data  and  finds  that  it  is  superior  to  the  Fourier  transform  and  is  com¬ 
parable  to  the  Karhunen-Loeve  (K-L)  in  rate-distortion  performance  while  maintaining  the  computational 
simplicity  of  a  transform  which  does  not  depend  on  the  picture  statistics.  Habibi  ( 14 1  has  shown  by  simu¬ 
lation  that  the  OCT  is  equivalent  in  a  mean-square-error  sense  to  the  K-L  transform  under  basis  restriction. 
A  100  point  ODCT  for  use  in  a  10  frame  per  second  experimental  TV  image  compression  system  has  been 
constructed  by  the  Naval  Undersea  Center  for  the  Advanced  Research  Projects  Agency  (ARPA)  using 
bucket  brigade  device  transversal  filters.  The  transversal  filter  has  199  nonzero  taps  and  both  the  cosine 
filter  and  sine  filter  required  for  complex  arithmetic  are  implemented  on  a  single  chip.  A  microphotograph 
of  the  filters  is  shown  in  Figure  1 2. 


TWO-DIMENSIONAL  DFT 

The  DFT  of  a  two-dimensional  array,  gp  j,, .,  niay  be  computed  by  successive  applications  of  the  one- 

dimensional  DFT.  This  concatenation  is  readily  observed  by  writing  the  expression  for  the  two-dimensional 
DFT  as 


N-,-1  I  N,-l  "'-’^”1*^1  \  ~j-’^”2‘^2 

(;(k|.k2)=  V  ^  gtni.nsfe  >  e 

112=0  /  n|=0  \ 


lOS 


Figure  12.  Bucket  Brigade  Chirp  Filters 


In  two  dimensions  the  concatenation  property  of  the  DFT  can  be  exploited  along  with  auxiliary 
memory  to  compute  the  two-dimensional  transtorm  by  succe.ssively  computing  the  CZT  of  rows  of  the 
input  signal  matrix  and  then  using  the  auxiliary  memory  as  a  row  to  column  transformation,  i.e..  transpos¬ 
ing  the  partial  Fourier  transform  matrix,  and  computing  the  two-dimensional  Fourier  transform  with  a 
second  CZT. 

An  alternative  method  of  computing  the  two-dimensional  DFT  is  to  use  linear  congruential  scan¬ 
ning  of  the  data.  The  two-dimensional  discrete  Fourier  transform  system  will  use  an  input  scanning  device 
and  a  one-dimensional  discrete  Fourier  transform  device  as  shown  in  Figure  1 3.  The  two-dimensional  trans¬ 
form  block  size  N  j  by  N2  is  chosen  such  that  N  j  and  N-»  are  relatively  prime  integers  (i.e.,  they  must  have 
no  common  divisor).  The  one-dimensional  Fourier  transform  device  has  a  block  length  of  N  =  N  |  Ns.  The 
purpose  of  the  input  scanning  device  is  to  so  order  the  input  data  that  the  one-dimensional  Fourier  trans¬ 
form  of  the  length  NjNi  serial  data  string  is  identical  to  an  N]  by  N-i  two-dimensional  Fourier  transform 
of  the  N I  by  Nt  input  data  samples.  If  desired,  an  output  scanning  device  may  also  be  used  to  provide  the 
transform  output  points  in  normal  order.  The  required  scan  may  be  derived  from  the  representation  1151 
of  a  one-dimensional  discrete  Fourier  transform  matrix  as  a  direct  product  matrix.  The  one-dimensional 
DFT  in  equation  ( 2 1 )  is  equivalent  to  the  two-dimensional  DFT  in  equation  ( 22)  when  N  =  N  |  N-i  and  N  | . 
N-i  are  relatively  prime 


n*0 


-j27rnk 


. N  -  1 


lOi 


for  n.  k  =  U.  I 


(21) 


Output 


Figure  13.  Two-dimensional  Discrete  Fourier  Transform  Device 


N|-l  N-,-1 

G(k,,k2)=  ^  ^  gtnj.n^)  e 

n|=0  n-,=0 

for  n^.kj  =0,  K  .  .  .  N  ^  -  I  and  nsk-v  =  0.  1 ,  .  .  .  -  I 

In  order  to  make  the  two  transforms  equivalent,  it  is  necessary  to  find  a  pair  of  one-to-one  functions 
n(n I .  ni)  and  k(k  | .  ki)  such  that 


n|k|  n-ik-, 
N,  ^  TTT 


_i,  nik,  n-ik-i 

(Modulo  1) 

N  N 1  N-) 

(23» 

or 

nk  =  n|k|N-) -r  n-,k-)N|  (ModuloN|N2) 

(24) 

This  may  be  accomplished  by  letting 

n(n  j.  n-))  =  njN-i -t- mNj  (Modulo  N) 

(2.S) 

k(k|.  k2)  =  k|U|N2  +  kiUiNi  (ModuloN) 

(2(1) 

where  the  constants  U|  and  Us  are  the  solutions  of 

N-)U|  =  1  (Modulo  Nj) 

(27) 

N  1  U-)  =  1  (Modulo  N-i) 

(28) 
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I  qu.itioiis  (2“’)  anil  (  2Mt  will  haxo  solutions  il  anil  onl\  il  N  |  anil  \  >  arc  nuiiu.ilK  itiiik'  I  lulcr  this  cdiuIi 
tion.  the  mappings  ilescribcil  b>  equations  (  25)  and  t  2(>)  will  satisr>  the  rcquircnicnt  ol  cquaiion  (  22')  1  he 

linear  eongruential  scan  preserihed  hy  I  quation  i  25)  may  also  he  used  to  perlorm  tiio-diniensional  eoino- 
lution  or  eiosseorrelation  using  an  ordinary  one-dimensional  transversal  filter  or  erosseoi relator 


TWO-DIMENSIONAL  CZT 

Once  the  data  is  in  a  two-dimensional  format,  with  simultaneous  serial  aeeess  to  all  the  rows,  n  may 
be  transformed  in  the  “horizontal”  direction  by  the  structure  shown  in  Figure  14.  The  individual  one- 
dimensional  discrete  chirp  filters  and  discrete  chirp  generator  of  Figure  14  may  be  implemented  using  C  (  l)s 
or  hybrid  digital  correlators.  An  acoustic  surface  wave  device  with  multiple  input  taps  may  he  used  to 
access  a  column  of  the  partially  transformed  output  in  a  single  shift  time  of  the  partial  transform  device. 
With  appropriate  coding  of  the  surface  wave  column  access  device,  it  may  also  perform  the  discrete  chirp 
premultiplication  and  the  discrete  chirp  convolution  of  a  DFT  in  the  “vertical"  direction.  ,A  complete  two- 
dimensional  CZT  architecture  is  shown  in  Figure  15.  and  the  required  coding  for  the  column  access  wave 
device  is  shown  in  Figure  l(i.  The  comple.s  arithmetic  may  be  implemented  as  described  previously.  ,A 
balanced  mixer  may  be  used  for  ;he  fast  multiplier  required  for  the  vertical  transform.  Lower  speed  variable 
transconductance  multipliers  may  be  used  in  the  horizontal  partial  transform. 


Figure  14.  Two-dimensional  Partial  Chirp-Z  Transform 
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Figure  15.  Hybrid  lmplcment;ition  of' Two-dimensionui  CZT. 
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Figure  16.  Tup  Weights  and  Structure  tor  Acoustic  Siirtucc  Wave  Combined 
Demultiplexer.  Chirp  Multiplier,  and  Discrete  Chirp  Filter 
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MODULAR  CZT 


Two  methods  for  combining  N  i  chirp-Z  transform  (CZT)  modules  of  length  N  |  to  perform  a  dis¬ 
crete  Fourier  transform  (DFT)  of  length  N|N-i  are  described.  The  first  method  uses  an  auxiliary  (rarallel- 
input.  parallel-output  DFT  device  of  si/.e  N  ->  and  allows  the  transform  of  sire  N  |N-.  to  be  performed  in  the 
same  time  required  for  a  single  CZT  module  to  perform  a  si/e  N  |  transform.  The  second  method  uses  an 
auxiliary  parallel-input,  serial-output  DFT  device  of  si/.e  N  ■>.  If  the  second  method  is  implemented  entirely 
in  a  single  technology,  such  as  with  CCDs,  it  performs  the  si/c  N  |  N  ->  transform  in  N  i  times  the  amount  of 
time  required  for  a  single  CZT  module  to  perform  a  size  N  |  transform;  if  N  i  is  a  composite  number,  say 
Ns  =  M  I M s.  the  second  method  also  permits  the  vime  hardware  to  perform  M  j  simultaneous  transforms  of 
length  N I  Ms. 

A  one-dimensional  discrete  Fourier  transform  may  be  written  as  a  partial  transform  of  a  doubly  sub¬ 
scripted  representation  of  the  data,  followed  by  a  pointwise  multiplication,  followed  by  a  second  partial 
transform  as  .shown  in  equations  ( 2‘))  -  ( 34): 


^'k  Xrf  ^n 
n=0 

e  N 

for  k  =  0. 

_ N  - 

-  1  and  N  =  N , 

1^2 
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,.n,  =0,.  . 

..N,-l 

1 

(30) 
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.  .Ns- 1 
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(31 ) 
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Ns 
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Figures  17  and  17a  show  modular  CZT  implementations  which  follow  from  equation  (34).  The  individual 
CZl  subsystems  shown  in  Figures  I  7  and  1 7A  would  be  similar  to  the  CZT  implementations  previously 
described.  The  parallel  DFT  required  for  the  second  partial  transform  in  Figure  17  may  be  implemented  as 
combination  of  summers  and  attenuators.  This  is  shown  in  Figure  1 H  for  N  -,  =  2.  In  general,  the  attenua¬ 
tion  factors  are  complex.  A  complete  double  length  CZT  is  shown  in  Figure  O.  Unfortunately,  a  parallel 
DF  r  implementation  of  this  type  becomes  unwieldly  if  the  dimension  Ns  is  very  large. 
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Figure  17.  Organi/;ition  of  Modular  (  ZT 
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I  hcM.'  i.'i|ii,ilioiK  Ikhc  hecn  it)  a  iraiislDiiii  ilcMct.-  in  whiJi  siiinaK  au-  iliiDupli  !)'.>>((  1 1 

tlcLn  lln^■^  at  iIiIUtl'iu  spoctls  AllcmalivcK  .  il  the  lat-UnN  in  i't|ii.ilK)n  (  '^  i  aro  iiiu  i pii-U'tl  a^  iw  >'  ))  ■)  •  ' 
pri'paiialinii  m  opposite  tliiwlioiis  relative  to  the  luiietion  to  he  liaiistoi  inetl  it  nia\  he  seen  that  tin  u  i 
lure  ot  1  leure  Jli  also  I'erlorins  a  diserete  i  ininer  tianslorni  with  speeil  ^oiiipaiahle  to  that  oI  a  (  /  I  \ 
surlaee  wave  vieviee  module  whieh  implememts  the  triple  produi.i  eonvolulional  etpiation  t  a'l  has  been 
huilt  hv  Reeder  |  l()|  at  United  Aireralt  A  sehematie  is  shown  in  1  leure  21 


rWO-DIMLNSIONAL  IK  T 

A  two-dimensional  DCI  may  he  eoinpuled  as  a  two-dimensional  hoiirier  tianslorni  ol  a  svmmetii 
eally  extended  data  hloek  In  order  to  minimi/.e  the  required  filter  lensrth.  tlie  scanning  ol  llie  data  block 
which  has  been  symmetri/ed  for  the  two-dimensional  OIX' T  will  now  he  considered  in  detail  1  he  data 
block  1 ,  extended  using  the  double  mirror  symmetry  defined  m  equation  t.wS) 


gl  i  n  I ,  i  n  s )  =  g(  n  I ,  n  1 ) 


(  xM 


I  he  two-dimensional  DI  T  of  the  extended  data  block  defines  the  two-dimensional  DCT  of  the  original  data 
block  given  in  equation  (  3'^^),  with  M  |  =  2N  |  -  I .  M  s  =  2N  s  -  1 
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f  igure  20  Parallel-Input,  Serial-Output  (  ZT  Llsing  Multi-Port  Transversal  Filter 
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l  iiiurc  Diinlo-Corrckitor  Df'  I'  Moiliilc 

Since  tlie  indices  are  onl\  delincd  Modnlo  M|  aiul  M respeclivciy.  tlie  suininalit)n  limits  in  are  re.ilK 
no  different  from  those  in  (  22)  If  the  scan  is  delincd  hy  equations  (  25 )  and  (  2(i).  then  tlie  s\  mmelr\  ol 
the  corresponding  one-dimensional  sequence  is  sliown  in  equation  ( 40l. 

f_P  =  g(  -n  I .  -n  1 )  =  g(  n  I ,  n  -! )  =  t  1 40 1 

The  toroidal  scan  ol  the  extended  tlata  block,  which  can  be  obtained  In  repeatedly  scanning  iioints  ol  the 
original  data  block,  is  illustrated  in  Table  I  for  a  block  si/e  of  2  b>  .T  1  he  numbers  in  the  table  indicate  the 
scan  order,  while  the  letters  indicate  the  data  values.  Similar  results  have  been  obtained  for  a  mixed  ()!)('  I 
by  EDCT  which  permit  the  use  of  a  square  block  si/e.  |  I7| 


SIMULTANEOUS  COMPUTATION  OF  THE  DFT  AND  THE  DCT 

The  close  relationship  between  the  DFT  and  the  DCT  permits  the  use  of  common  modules  to 
simultaneously  compute  both  transforms.  This  may  be  accomplished  most  simply  when  an  tDCT  is  com¬ 
puted  using  DFT  modules.  The  sum  in  the  tDCT  defining  equation  (18a)  may  be  interpreted  as  a  length 
2N  DFT  of  the  extension  of  the  function  g  by  N  zeros.  This  leads  to  the  configuration  shown  in  Figure 
22.  Alternatively,  if  the  odd  and  even  frequencies  in  the  zero-filled  DF  T  are  considered  separately,  they 
may  be  computed  using  length  N  DFT  modules  as  shown  in  Figure  23 
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Figure  22.  Computation  of  the  DFT  and  EDCT  Using  a  Single  Length  2N  DFT  Module 
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CONCLUSION 


li  h.l^  hocn  show  ii  that  a  Miiall  ntimiH'i  ol  niiilti|Hirp<>N>.'  moiinlos  tan  pun  nh-  a  hiiiliK  I'aialK  I 
\triKturo  witli  mmiinal  control  incrhcad  tor  the  ct>ni|nitalion  ot  nian>  linear  tiaiislonnv  [  he  tiansloinis 
include  one-  and  tw o-dinienMonal  discrete  1  onner  iranslorms  aiul  iliscrete  (.osme  transtoriiis  I  he  ha^u 
inoilules  nia\  be  chosen  to  be  imilliphers,  serial  access  meinones.  and  discrete  chirp  lilteis  iii  ciosscoi relatoi 
(  urrent  acoustic  surlace  wa\e  and  CC  1)  technoloi’ies  permit  small,  low  power.  h”htweiehi.  Ineli  speed 
implementations  ot  the  rerpiired  modules  and  permit  real-time  solutions  to  hieliK  demandme  sieiial  pow 
essing  tasks  ineludine  real-time  video  data  compression,  |  l,S|  spread  spectrum  communk at mn  .md  hipb 
resolution  radar  signal  processing. 
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APPENDIX  H 


INVESTIGATING  ADAPTIVE  TRANSFORMS  IN 
INFORMATION  PROCESSING  APPLICATIONS 


PREFACE 


This  report  describes  work  performed  during  the  five  month 
period  from  June  1,  1975  through  November  27,  1975.  The  investi¬ 
gation  was  funded  by  the  Naval  Undersea  Center,  San  Diego,  Calif¬ 
ornia  under  contract  no.  N  66001-75-0-226  MJE,  and  was  conducted 
at  UCLA  under  the  direction  of  Judea  Pearl  as  Principal  Investi¬ 
gator.  The  team  engaged  in  this  study  consisted  of:  Judea  Pearl, 
Massih  Hamidi  and  Yechiam  Yemini. 

The  continuous  guidance  and  encouragement  of  Harper  Whitehouse, 
Jeff  Speiser  and  Robert  Means  of  the  Naval  Undersea  Center  deserve 
the  major  credit  for  the  accompl i shments reported  here. 


ABSTRACT 


The  performances  of  the  Discrete  Cosine  Transform  (CCT)  and 
the  Discrete  Fourier  Transform  (DFT)  were  analyzed  and  compared 
in  signal  processing  applications.  Conditions  for  the  asymptotic 
optimality  of  the  DFT  and  DCT  were  established.  The  superiority 
of  the  DCT  over  the  DFT  was  established  for  Markov-1  signals.  A 
tool  for  studying  asymptotic  behavior  of  transform  was  developed 
using  numerical  quadrature  analysis. 
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SUMMARY  OF  WORK  PERFORMED 


Guided  by  Naval  Undersea  Center  directives  the  emphasis  of 
our  investigation  has  been  to  analyze  and  compare  the  performance 
of  two  practical  transform  techniques,  the  Fourier  (DFT)  and  the 
Cosine  (OCT)  transformation,  in  image  processing  applications. 

Our  study  has  resulted  in  several  new  results  with  both  theoreti¬ 
cal  and  practical  significance. 

1.  While  it  is  well  known  that  the  Fourier  coefficient  of  sta¬ 
tionary  finite-duration  continuous  signals  are  asymptotically 
uncorrelated,  correlation  properties  of  finite-dimensional, 
discrete  Fourier  Transform  remained  an  open  question.  We  have 
established  the  following  facts: 

a.  If  the  covariance  sequence  is  summable  the  magnitude  of 
every  off-diagonal  covariance  element  converges  to  zero 
as  N  -►  ». 

b.  If  the  covariance  sequence  is  only  square-summabl e  the 
magnitude  of  the  covariance  elements  sufficiently  far 
from  the  diagonal  converges  to  zero  as  N  ->  «. 

c.  If  the  covariance  sequence  is  square-summabl e  the  weak 
norm  of  the  matrix  containing  only  the  off-diagonal  ele¬ 
ments  converges  to  zero  as  N  ->  ». 

d.  If  the  covariance  sequence  is  summable  the  weak  norm  of 
the  matrix  containing  only  the  off-diagonal  elements  con¬ 
verges  to  zero  at  least  as  fast  as  . 


2.  It  was  conjectured  that  the  performance  of  the  Cosine  trans¬ 
form  (DCT)  is  superior  to  that  of  the  Fourier  transform  (DFT). 
The  fact  that  OCT  was  found  more  compatible  with  the  hardware 
configuration  of  the  Image  Processing  group  at  IIUC  called  for 
an  analytical  examination  of  this  conjecture,  lie  were  success¬ 
ful  in  establishing  the  following  results: 

a.  The  DCT  is  asymptotically  equivalent  to  the  Karhunen-Lo^ve 
transform  (KLT)  of  Markov-1  signals  and  the  rate  of  conver¬ 
gence  is  (similar  to  the  DFT)  on  the  order  of 

b.  The  DCT  offers  a  better  approximation  to  the  KLT  of  Markov- 
1  signals  than  the  DFT  for  all  values  of  N  and  p. 

c.  The  DCT  is  asymptotically  equivalent  to  the  KLT  of  all 
finite-order  Markov  signals. 

d.  The  analysis  of  asymptotic  properties  of  discrete  trans¬ 
form  can  be  simplified  substantially  using  numerical 
quadrature  techniques. 

A  detailed  description  of  these  results  are  containea  in  the 
following  three  appendices. 
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APPENDIX  I 


ON  THE  RESIDUAL  CORRELATION  OF  FINITE¬ 
DIMENSIONAL  DISCRETE-FOURIER- 
TRANSFORMS  OF  STATIONARY  SIGNALS 

Massih  Hamid i 
and 

Judea  Pearl 


Published  in  IEEE  Transactions  on 
Information  Theory,  July  1975 
pp.  480-482 
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ABS  TRACT 


The  covariance  matrix  of  the  Fourier  coefficients  of  N-sampled 
stationary  random  signals  is  studied.  Three  theorems  are  established. 

1.  If  the  covariance  sequence  is  sunmable  the  magnitude  of  every 
off-diagonal  covariance  element  converges  to  zero  as  N 

2.  If  the  covariance  sequence  is  only  square-sunmable  the  magnitude 
of  the  covariance  elements  sufficiently  far  from  the  Hi  agonal 
converges  to  zero  as  N 

3.  If  the  covariance  sequence  is  square-summable  the  weak  norm  of 
the  matrix  containing  only  the  off-diagonal  elements  converges 
to  zero  as  N 

The  rates  of  convergence  are  also  determined  when  the  covariance 
sequence  satisfies  additional  conditions. 


126 


1.  INTRODilCriDN 

It  is  wfl I  known  that  tho  Courior  cooff i c i ents  of  stationary  finite-dura¬ 
tion  continuous.  si(jna1s  are  asyiaptoti cal  ly  uncorrolated.  Root  and  Pitcher^^^ 
have  shown  that  tlv?  cross-correlation  between  any  tv/o  Fourier  coefficients  ap¬ 
proaches  zero  as  the  signal  duration  grows  infinite.  Consequently,  problems 
involving  stationary  stochastic  processes  are  often  treated  by  approximating 
the  original  processes  by  Fourier  series  with  uncorrelated  random  coefficients. 

In  applications  involving  digital  signal  processing  the  continuous  signals 
are  sampled  at  a  finite  number  N  of  equally  spaced  points  in  time  and  are 

treated  as  N-dimensional  vectors.  Root  and  Pitcher's  results  coupled  with  the 

[21 

computational  convenience  of  the  Fast-Fourier-iransform'-  gave  rise  to  a  num¬ 
ber  of  applications  whereby  the  finite  Fourier  transform  of  the  signal  vector 
is  taken  and  its  components  are  treated  as  uncorrelated  random  variables.  For 
example,  in  digital  transmission  of  pictures  and  voice  it  is  a  common  practice 

to  assign  each  Fourier  component  a  digital  code  which  is  independent  on  the 

[31 

magnitude  of  the  other  Fourier  coefficients^ 

Tlie  Fourier  coefficients  of  a  signal  N-vector  Xj^  =  (xq,  x^,  ... 
are  uncorrelated  if  and  only  if  the  covariance  matrix  of  ^  circulant 

matrix^^^;  i.e.,  if  E^(x.x-)  is  a  function  only  of  (i-j)  mod  N  .  Since  such 

*  \l 

circular  symmetry  is  very  rare  in  actual  processes  the  covariance  matrix  of 
the  Fourier  coefficients  will  contain  off-diagonal  elements  whose  magnitudes 
affect  system  performance. 

It  is  generally  believed  that  the  magnitude  of  each  off-diagonal  element 
and  their  cumulative  effect  both  converge  to  zero  as  N  .  In  this  paper  we 
establish  conditions  under  which  these  convergences  take  plac^  and  derive  ex¬ 
pressions  for  the  rate  at  which  the  residual  cross-correlation  decays  with  N  . 


2.  THEOREiMS  AND  PROOFS 

T 

Consider  a  sampled  sequence  =  [x^,  ,  -  ^  ^  stationary 

stochastic  process  with  a  Tosplitz  correlation  matrix  Tj^  Ejx^,  satisfy¬ 

ing 

"  T^(I>-jl)  =  tdi-ji)  i,j-0,l,  - N-1 

where  x  designates  the  complex  conjugate  of  x  . 

The  discrete-Fourier-transform  (DFT)  of  X|,j  is  a  vector  ^  [y^, 
defined  by  =  F,^ 

where 

[Ff,]  =  Wjj"  m,n  =0,  1,  N-1 

tn,n 

and 

«„  - 

We  study  the  behavior  of  the  off-diagonal  elements  of  the  matrix 
E{y|^|  F^ 

for  large  N  ,  as  well  as  the  norm  of  T|>^  -  where 

£  diag.j  E()yj2)  ,  E(ly^I^)  ••••  Edy^.il^) 

The  motivation  for  studying  this  norm  lies  in  the  fact  that  in  many  signal  pro¬ 
cessing  applications  the  performance  degradation  caused  by  the  residual  corre¬ 
lation  can  be  upper  bounded  by  monotonic  decreasing  functions  of  iTj,^  - 

Theorem  1 

If  t(t)  is  suitmable  then 
N  -►  00 

Moreover,  lim  E{!yn,l^}  =  2  J  t{u)  -  t(o)  =  I  t(u) 

H  ^  u^o  u=-» 
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Proof:  By  definition 


1  N-1  N-1  r  1  N-1  ,  V  c 

x,=o  r=o  I  zr  i-o  u=?-(n-l) 


In  order  to  obtain  an  expression  with  only  single  summation  we  use  Abel's  Trans¬ 
formation  Formula  for  Partial  Summation^^^ : 

kl  '  kl 

where 


A 


"  j=o  J 


for  k  ^  0  and  A_i  =  0 


Letting  a^^  = 


and  b  =  I  t(u)  we  obtain 

u=«-(N-l) 


NEly„/„}  =  I 

1=0 


N-1  (i-l 


k=o 


wj'  [t(N-«)  -  t(»)] 


for  m  ji^  n  ,  and: 


N-1 


N-1 


'm'  '  'N 

£=0  U=0 


NMlYml  }  =  I  Wr[t(N-)l)  -  t(£)]  +  N  I  iC  t(u) 

From  equation  ( 1 ) : 


^'yjn) 


<  N 


-1 


N-1 fl-1 


I 

z=o 


I  W 


k(m-n) 


k=o 


The  first  term  can  be  bounded  by  N"^  I  £lt(ii)l 

1=0 

For  the  second  term  we  use  (for  m  n) 

V  =  0  =  ^  ^  +  V 

k=o  ^  k=o  ^  k=N-u  ^ 


N 

N-1 


t(£) 


+  N 


-1 


N-1 fi-1 

I 


(1) 


(2) 


£=0 


k=o 


I  w|S('"-"KH(N-£) 


and  obtain 


.-1 


N-lft-1 

1  i  “S 

£=olk=o 


w5H(N-t) 


.-1 


N 


<  N"  [  ult(u)| 
u=l 
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Hence: 


Elynj^n^l  i  I 
■“  0 


Since  tin)  is  sunmable;  given  e  >  0  ,  there  exists  an  integer  M  such  that 


I  lt(a)[  <  I 

Ji=M  ^ 


Thus: 


h(va)l  i 


1  M-l  N 

N"  I  Alt(Ol  +  I  |t(01 

t=o  Jl=M 

,  M-1  ■  ^ 

Now  choosing  N  large  enough  to  have  N  ^  t|t(s.)l  j  establishes  the 

Jl=0 

first  part  of  our  proof. 


From  equation  (2),  we  obtain,  after  some  computation 


N-1 


N 


Edy^l'"}  =  I  W™  t(u)  +  I  W-"’''t(u)  -  N"''  I  u 


N-1  [ 


'm' 


’N 


u=0  ■  U=1 

taking  the  limit  of  each  term,  we  have: 

|N-1 


u=l 


t(u) 


1  im  N 


-1 


N 


I  uW""’"t(u) 
u=0 


N-1 


^  lim  N“^  I  u|t(u)|  =  0 

N  -*•  ”  u=o 


also,  it  can  be  shown  that 
^  jtmu 


lim  I  Vfc;  t(u)  =  I  t{u) 
N  “  u=o  u=o 


(3) 


Hence: 


lini  =  2  [  t(u)  -  t(0)  =  I  t(u) 


N  ->•  “ 


u=o 


u=-" 


Q.E.D. 
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To  show  that  (3)  is  valid  notice  that  t(u)  beinq  siitmiable,  given  any  r  >  0 
there  exists  an  integer  K  such  that 


y  t(u)  <  I  and  I  I  t(u)  I  y 
u=K  u-K 


I  t(u)  -  lim  I  t(u) 

u=o  N  “  u=o 


oo  K-1 

< 

y  t(u)  -  y  t(u) 

+ 

1  W,;:  ^  t(u) 

U=0  U=0 

N  -<■  »  u=K 

£D  N 

y  t(u)  +  lim  y  I t(u) I  <  e 
u=K  N  ^  «>  u=K 


Theorem  2 

If  t(Jl)  is  square  summable,  lim  E{y  y  }  =  0  for  all  elements  such 

N  -V  ®  " 

that  N"  |m-n|  >  e  >  0  . 


Proof: 


From  equation  (1 ) : 


1  -  w 


(m-n) 


-1 


N-1  , 

il=0 


[t(N-£)  -  t(t)] 


Applying  the  Cauchy-Schwarz  inequality: 


<  N 


-1 


1  -  W 


(m-n) 


<  2^2  n 


N 

1/2 


-lfN-1 

I 

1=0 


..nt 


2^V2ffj.i 


y  it(N-£)  -  t(t)r 

t=o 


1/2 


1  -  e 


i  {m-n)2Tr/N 


-1 


'  N 

y  it(oi' 

£=0 


1/2 
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I 


Hence,  whenever 


N~^ |m-nl 


>  e  >  0 


Theorem  3 


lim  I 
N 


summable. 


^  1  -  ? 

m,n-o 
m  n 


1/2 


0 

Q.T.D. 


0  ,  if  t(u)  is  square 


Proof: 

rsi 

From  equation  (1)  we  obtain  with  some  computations'- 

-  N-1  _  p  N-1  P  P 

I  l^Vn^  =  N  I  (NX  -  f/)|t(N-0  -  t(£)r 

m,n=o  9~o 

m  /  n 

Hence 

I^N  •  V 

'  '  m,n=o  £=o 

m  /  n 

<  4N’^  V 
£=0 


Let  e  >  0  ,  arbitrary,  be  given.  Since  t(£)  is  square  summable  there  exists 

”  2 

an  integer  P  such  that  J  lt(£)|  <  .  Thus 

£-P  ® 


Tn  -  C„  1  4 


N'^  Y  (NK  -  £^)|t(t)|^  *  I  |t(«)|^ 

£=0  £=P 


?  P-1 

Choosing  N  large  enough  to  have  N  I  (N£ 

£=0 


i^)|t(OI^ 


<  I  ,  yields 


<  E 


Q.E.D. 


The  convergence  rates  of  both  the  off-diagonal  elements  and  the  norm  of 
T|^  -  Cj^  can  be  obtained  under  certain  conditions  from  the  proofs  of  theorems 
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1-3.  These  will  he  listed  in  the  following  three  corollaries: 


Corollary  1  -  If  ct(«.)  is  sumntable  then  =  o(N)  for  m  ^  n  . 

Corollary  2  -  If  t(e)  is  square  summable  then  =  o(N^^^)  for 

all  |iii-nl  =  0(H) 

Corollary  3  -  If  t(t)  is  square  summable  then  jTj^  -  Cj^j  =  o(N^^^)  . 

Note  that  these  conditions  on  t(t)  are  satisfied  for  most  processes  en¬ 
countered  in  practice,  e.g.,  finite  order  Markoff  or  moving-average  processes. 

3.  CONCLUSIONS 

The  theorems  and  corollaries  established  in  this  paper  constitute  a  gen¬ 
eralization  of  Root  and  Pitcher's  results  to  the  case  of  discrete-time  signals 
and  provide  guidelines  for  selecting  the  proper  block  length  N  in  Fourier- 
signal  -processing  applications. 
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APPENDIX  II 


COMPARISON  OF  THE  COSINE  AND  rOURlEH 
TRANSFORMS  OF  MARKOV-1  SIGNALS 

Massih  Hamidi 
and 

Judea  Pearl 


Submitted  for  publication  in  IEEE  Transactions  on 
Acoustics,  Speech,  and  Signal  Processing 


I.  INTRODUCTION 


In  a  recent  paper  Ahmed,  et  propose  a  new  transform  called  Discrete 

Cosine  Transform  (DCT)  and,  based  on  empirical  evidence,  conjecture  that  its  per¬ 
formance  is  closer  to  the  optimal  Karhunen-Loeve  transform  (KLT)  than  the  other 
commonly  used  transforms  (i.e..  Discrete  Fourier,  Walsh-Hadamard,  Haar).  Means, 
et  al.^^^  actually  used  the  DCT  for  encoding  TV  pictures  in  real  time. 

Pearl  showed^^^  that  for  a  signal  statistic  characterized  by  a  covariance 
matrix  T  ,  |T  -  Ty]  (to  be  defined  later)  constitutes  a  measure  of  performance 

for  a  transform  U  ,  in  the  sense  that  the  error  bounds  (in  coding  and  filtering) 

2 

are  increasing  functions  of  |T  - 

The  purpose  cf  this  investigation  is  to  determine  the  relation  between 
|T  -  Tj.p  (the  norm  obtained  using  the  discrete  cosine  transform)  and  |T  -  Tp| 
(the  norm  obtained  using  the  discrete  Fourier  transform  (DFT)),  thus  testing  the 
conjecture  of  Ahmed  and  his  collaborators. 


II.  DEFINITIONS  AND  NOMENCLATURE 

I 

Let  T  be  a  Tbeplitz  matrix  and  U  an  orthogonal  transform.  Let  T 

UTU~^  be  the  representation  of  T  in  the  new  basis,  and  Tj^  =  diag 

(T^^,  T22,  define  T^  to  be  the  representation  of 

T,  in  the  first  basis,  i.e., 

U 

Ty  =  U-'  t;,  U 


and  It  -  TyP  the  Hilbert-Schmidt  norm  of  T  -  Ty  ,  i.e., 

M-1 


IT  -  Tyl 


M 


l(T-Ty) 


(m,n=o 


mn 


The  cosine  transform  representation  of  a  Toeplitz  matrix  T  is  given  by 
CTC”^  where  C  is  a  MxM  matrix  defined  by 
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"OJ 


2  1 
H/Z 


^kj  "  R 


A  simple  algebraic  manipulation  shows: 


*  *  o 

cc  =  C  C  =  ^  I 


j  =  0,  1,  •••,  M-1 

k  =  1,  2,  • • ■ .  M-1 
j  =  0,  1,  M-1 


(where  X  indicates  the  complex  conjugate  transpose  of  X  ).  Hen  ■>: 


c-'  =  ^c* 


In  contrast,  the  DFT  is  defined  by  a  unitary  matrix  F  where 

'"kj  "  exp|i  ^  kjj  k.j  =  0,  1,  M-1 


III.  COMPARISON  OF  |T  -  AND  |T  -  Tp|^ 

For  any  orthogonal  matrix  U  (e.g.  U  =  C  or  U  =  F  )  we  have 

M-1 

I 

m=o 

,2 


IT  -  T.,r  =  |t'  -  t’|2  =  |tV  -I'Y  !(UTU-^) 


'U' 


mm' 


M-1 


=  |T|2-'  I  |(UTU-’)J 


m*o 


i.e.:  The  higher  the  norm  of  the  diagonal  vector  of  the  transformed  matrix, 

the  lower  |T  -  T^j|  and  the  better  the  transform.  Hence,  to  compare  |T  -  Tj.|' 
2 

and  |T  -  Tpl  it  suffices  to  compare 


M-1  ,  „  M-1  ,  p 

I  l<CTC-')J^  and  I  l(FTF-') 
m=o  m*o  ' 


We  consider  matrices  of  the  form 
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T  = 


1  P  P' 

P  1 


M-1 


M-2 


M-1 


which  represent  covariance  matrices  of  Markov-1  signals, with  0  ^  p  _5  1 
being  the  covariance  coefficient  between  adjacent  samples. 

Clearly,  for  p  =  0  and  p  =  1  the  cosine  and  Fourier  transforms  are 
equivalent,  since  T  is  diagonal  in  both  representations.  For  an  intermediate 
value  of  p  we  obtained: 


and 


'00 


1  P  2  p(l  -  p^h 
1 


(CTC'^) 


mm 


1 

2 


P(1  -  (-itf  )■ 
M 


for  m  ^  0,  where  a  =  m-rr/pM. 

An  elementary  (but  tedious)  computation,  leads  to: 


M-1  , 

Z  KCTC'b 
n»=o 


M(l+p^)  4p^  . 

,  2  ■  /, 

1  -  p  (1-P  } 


«  /  1  2M\  2  n 

/S  •  [3(l+p^)  +  4p] 


M^(l-P)^ 


for  M  =  2k  ,  k  >  1  (i.e.  M  even  ^  4}  . 
Combined  with 

9  ■1,2  o^/i  2M\ 

n|  II  m  2  “TP 

1  -  p  (1  -  p  } 


we  finally  obtain  the  desired  norm: 
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2 


2..^n  ^  0^") .  2p^(i  - 
(1  -  M(1  - 


O  An^ / 7  ^ \ ^ 

(3  +  4p  +  3p^)  + 

M^d  -p)^ 


Note  that 

lim  IT-T  1  =  0 

M  •  >  00 

implying  that  the  DCT  is  asymptotically  equivalent^^^  to  the  KLT  of  Markov-1 
processes.  Moreover,  since  for  large  M  and  p  ^  1  we  have 


|t-tj  =  /a'  0(m"‘'/2) 

1-p 

we  conclude  that  the  degradation  in  performance  in  filtering  and  coding^^^  van¬ 
ishes  like  . 

In  order  to  calculate  [T  -  Tp|^  recall^^^  that  for  T..  =  t(|i-jl)  we 

have 


(T  -  Te)  =  [t(|i-J|)  -  t(H  -  |i-o|)] 

ij 

and  substituting  t(|i-j|)  =  we  obtain: 


M|T  -  Tp|2 


2p^(l  p^'^b  2(l+p^)p^(l-p^^)  _  p^(M^-l) 

(1  -  p2)2  M(1  -  p2)3  3 


It  shows  that  the  asymptotic  behavior  of  |T  -  Tp|  for  large  H  is  identical 
to  that  of  |T  -  T^l  .  Thus,  the  performance  difference  between  the  DCT  and 
the  DFT  must  vanish  like  M'^  .  Indeed,  for  large  M  one  obtains  the  posi¬ 
tive  difference 


IT  - 


IT 


'V 


M^(l  -  P^)(l  +  P)^ 


P  <  1 


indicating  that  the  cosine  transform  is  closer  to  optimal  than  the  Fourier 
transform  over  the  entire  range  of  0  <  p  <  1  . 
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For  moderate  values  of  M  we  should  examine  the  expressions  for  |T  -  Tp ) 
2 

and  |T  -  T^|  over  the  range  0  ^  p  _<  1  •  The  two  are  plotted,  in  a  normal- 
ized  form,  in  Figure  1.  We  chose  |T  -  I|  as  a  common  normalizing  factor, 
where  I  is  the  identity  matrix,  and  so 

|T  -  I|2  =  — ie^[M-l  -  Mp2  +  p2M]  . 

It  measures  the  degree  of  cross-correlation  contained  in  the  unprocessed  sig¬ 
nal,  and  therefore  the  maximum  amount  of  decorrelation  that  can  be  accomplished 
by  any  transform  (i.e.  the  KLT).  The  ratio 


represents  the  fractional  correlation  left  'undone'  by  a  transformation  U  . 

Figure  1  shows  that  for  M  =  8,  16,  64,  and  for  the  entire  range  of 
2  2 

0  <  p  <  1  ,  [T  -  Tp(  is  higher  than  (T  -  .  The  difference  between 

the  two  are  quite  noticeable,  occasionally  reading  a  ratio  of  2  :  1. 

CONCLUSIONS 

We  established  that  the  DCT  is  asymptotically  equivalent  to  the  KLT  of 
Markov-1  signals  and  demonstrated  that  the  rate  of  convergence  is  on  the  order 
of  M"^^^  .  I  T  -  Tj,|  ^  is  shown  to  be  smaller  than  |T  -  Tp]  ^  for  all  values 
of  M  and  ,  i.e.,  the  Discrete  Cosine  Transform  offers  a  better  approxima¬ 

tion  to  the  Karhunen-L&eve  transform  of  Markov-1  signals  than  the  Discrete 
Fourier  Transform. 
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I.  INTRODUCTION 


Discrete  unitary  transforms  are  used  extensively  in  digital 
signal  processing.  Both  voice  and  images  were  successfully  en¬ 
coded  and  filtered  using  Fourier,  Cosine,  Walsh,  Haar,  and  Kar- 
hunen-Logve  expansions'^  All  unitary  transformations  are 

information  preserving  and  no  bandwidth  reduction  results  from 
the  application  of  the  transform  to  the  signal.  Instead,  its 
beneficial  effect  lies  in  redistributing  the  variance  associated 
with  each  signal  sample  into  almost  uncorrelated  variables,  thus 
permitting  the  transform  coefficients  to  be  processed  (e.g.,  co¬ 
ded  or  filtered)  individually^®^.  When  the  signal  statistic  is 
known  one  can  find  an  optimum  transformation,  the  Karhunen-Lo§ve 
transform  (KLT),  which  totally  decorrelates  the  transform  coeffi¬ 
cients  and  results  in  an  optimal  performance. 

Real -time,  impl ementati on  of  the  Karhunen-Lo6ve  expansion  suf¬ 
fers  from  three  major  drawbacks:  1)  A  large  amount  of  sampled- 
data  and  computational  effort  is  required  for  estimating  the  di¬ 
rection  cosines  of  the  Karhunen-Lo§ve  basis  vectors.  2)  The  task 
of  projecting  an  N-component  incoming  signal  vector  into  its  prin- 
cipal  coordinates  requires  N  computer  multiplications.  3)  A  new 
NxN  matrix  must  be  calculated  for  each  change  in  the  statistical 
properties  of  the  environment. 

The  advantages  of  other  transform  techniques  (e.g.,  Fourier, 
Cosine,  Walsh)  lies  in  the  fact  that  they  possess  fast  computation 
algorithms  and  that  they  employ  a  fixed  set  of  unitary  matrices 
independent  of  the  signal  statistics.  Their  performances,  however 


should  be  judged  by  the  degree  to  which  each  transform  approxi¬ 
mates  the  Karhunen-Lodve  expansion  in  decorrelating  the  signals. 

A  good  measure  of  the  deciree  of  correlation  still  remaining 
after  the  application  of  a  specific  transform  is  given  by  the 
norm  of  the  matrix  containing  the  off-diagonal  covariance  ele¬ 
ments  of  the  transformed  coefficients.  This  norm  was  shown^^^ 
to  control  the  performance  degradation  resulting  from  residual 
correlation  in  both  coding  and  filtering.  When  this  norm  is 
small  one  is  assured  that  the  performance  degradation  will  re¬ 
main  below  a  tolerated  level  and  therefore,  the  behavior  of  this 
norm  for  large  N  governs  the  proper  selection  of  the  signal  di¬ 
mension  M.  When  the  norm  generated  by  a  specific  transform  van¬ 
ishes  at  large  N  we  say  that  that  transform  is  asymptotically 
equivalent  to  the  KLT.  The  rate  at  which  the  norm  vanishes, 
however,  determines  quality  ranking  among  several  contending 
transforms . 


Consider  a  continuous  signal  which  is  sampled  at  N  time 
points  to  give  an  N-dimensional  vector  x.  Let  Cj^  be  an  N-dimen- 
sional  unitary  transform  matrix  and  ^  4  x  the  transformed 
signal.  If  ^  is  considered  a  random  vector  with  an  autocovari¬ 
ance  matrix  T  then  the  transformed  autocovariance  matrix  is 
*  * 

T  (C|^j  is  the  adjoint  matrix  of  Cj^).  The  norm  determining 
the  degree  of  decorrelation  achieved  by  C|^  is  given  by 


2 

i  j 


Now  consider  a  sequence  of  such  signal  vectors  wit.i  increasing 
dimension  and  the  corresponding  sequence  of  autocovariance  matrices 


oo 

{T.j}  [N  win  be  called  the  block  size].  We  may  transform 
'  N=1 

each  T|,j  using  a  transform  of  corresponding  dimension  Cj^  taken 

oo 

from  a  transform  sequence  {C^}  .  We  want  to  examine  the  de- 

N=1 

gree  of  diagonal ization  of  the  transformed  sequence 
as  the  block  size  grows  to  infinity. 

If  the  signal  covariance  matrices  were  known,  it  would  be 
possible  to  compute  (numerically)  the  value  of  ^ 
for  each  N,  observe  its  behavior  or  verify  asymptotic  equivalence. 
In  most  cases,  however,  a  system  of  transforms  must  be  decided 
upon  with  only  partial  knowledge  of  the  input  statistics.  Tech¬ 
niques  using  the  discrete-Fourier  transform  (DFT),  or  discrete 
cosine  transform  (DCT)^^^,  for  instance,  are  often  expected  to 
process  stationary  signals  of  arbitrary  statistics.  Analytical 
techniques  must  be  employed  for  examining  asymptotic  behavior  of 

entire  class  of  covariance  matrices  T^j.  A  basic 

* 

difficulty  impeding  such  analysis  is  that  each  element  of  Cj^  Tj^  Cj^ 
consists  of  sums  of  components,  where  both  the  component  values 
and  the  size  of  the  sum  are  changing  with  N.  The  expressions  re¬ 
ceived  for  those  sums  are  formidable,  which  make  it  almost  impos¬ 
sible  to  derive  analytic  results  regarding  the  asymptotic  behavior 
of  a  given  transform. 

This  paper  attempts  to  reduce  this  difficulty  be  deriving 
certain  limit  behavior  of  the  elements  of  the  transformed  matri¬ 
ces.  An  analytical  framework  is  developed  which  relates  the  prob¬ 
lem  at  hand  to  known  theories  of  analysis,  thus  making  it  possible 
to  utilize  known  limit  theorems  of  classical  analysis  to  derive 
limit  behavior  of  transforms. 
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2.  BASIC  DEFINITIONS 

Let  be  the  space  of  symmetric  and  real  N-dimensional 

matrices.  On  we  use  two  norms,  the  weak  norm  and  the  strong 

N-1 

norm.  Let  be  a  matrix  with  eigenvalues  { ^ J  .  The  weak 


norm  of  A  is  defined  by: 


,  ,  N-1  N-1  , 

\<  A  i  .1  i  |a 

1=0  j=o 

norm  of  A  is: 


,  N-1 
N  l^i 


A  sup|  <Au_,  i^>  :  ||u.||=  1  u.ETR^} 


[where  <  ,  >  denotes  an  inner  product  on 


0<i<N-l 


Both  norms  are  invariant  under  unitary  transforms.  That  is,  if  C 


is  unitary  |A|  =  |C  A  C*|  and  |1A||  =  ||  C  A  C*||.  Moreover,  ||A|||^  >  |  A  j 
which  is  where  the  norms  acquire  their  names.  The  weak  norm  of  the 
off-diagonal  portion  of  the  transformed  covariance  matrix  (namely, 

I C  A  C*  -  DIAG(C  A  C*)|)  measures  the  degree  of  residual  correla¬ 
tion  between  the  transformed  signal  components  and  bounds  practi¬ 
cal  performance  degradation  resulting  from  residual  correlations^^^. 

To  be  able  to  consider  sequences  of  matrices,  such  as  the  au¬ 
tocovariance  matrices  of  signals  with  increasing  block  size,  we 
define  nets.  A  net  is  a  strongly  bounded  sequence  of  matrices 
{Am}”  ,  such  that  for  every  N,  Aj.  is  a  NxN  matrix.  Two  nets. 


a  =  {A|^}  and  B  =  {Bj^}  are  called 


equivalent  if 


A  matrix  class  A»  is  a  collection  of  nets.  The  N-section 
of  -A- is  the  collection  of  N-dimensional  matrices  wh’ch  belong  to 
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nets  in  A.  We  denote  the  N-section  of  A  by 

Let  /^'be  the  matrix  class  of  all  nets  of  positive  diagonal  ma¬ 
trices.  Uris  called  the  diagonal  class . 

Let  y  =  {Cj^}  be  a  net  of  unitary  transform  matrices  and  3  = 

{Ajij}  an  arbitrary  net.  Then  by  y  a  y  we  mean  the  tr  a  n  s  f  0  rined  net 

{^N  is  a  matrix  class  we  denote  by  y  a  y  the  trans- 

★ 

formed  class,  y  jCry  v/e  call  the  y-spectral  representation  of  the 
★ 

class  A .  y  y  is  called  the  class  diagonal  in  y-  As  an  example, 
the  class  diagonal  in  the  net  of  Fourier  transforms  is  the  class  of 
circulant  matrices. 

Given  a  transform  net  y  we  consider  the  performance  of  y  on  a 
class  of  covariance  matrices  5^,  called  the  signal  class,  y  performs 
well  on  T,  in  the  sense  that  it  approximately  diagonalizes  the 
class  T,  if  every  net  y  t  y  in  y  CT  y  is  asymptotically  equivalent 
to  a  diagonal  net.  This  motivates  the  following  definition;  if  A 
and  ^  are  matrix  classes  A  is  said  to  be  an  asymptoti c  cover  of  /T  ; 
if  for  any  net  &  i  (3  ,  there  is  a  net  a€A,  such  that  a  and  B  are 
asymptotically  equivalent.  We  use  the  notation  to  denote 

this  fact.  For  example,  the  class  of  circulant  matrices  is  known 
to  cover  the  class  of  Toeplitz  matrices^^^.  Similarly,  the  class 
of  Markov-1  Toeplitz  matrices  has  been  shown  to  be  covered  by  the 
class  diagonal  in  the  Cosine  transf orm^^^ . 

The  problem  of  evaluating  the  asymptotic  performance  of  a  trans¬ 
form  technique  is  formulated  as  follows:  For  a  given  transform  net 
Y,  does  the  diagonal  class  *<^asymptotical  ly  cover  y  T  y  7  From  the 
definition  of  asymptotic  equivalence  of  nets,  and  the  invariance  of 
the  weak  norm  under  unitary  transformations  we  have: 
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0 


/y  ^  y  7  y*  y*‘^  y  ,3.  T  • 


That  is,  our  problem  Is  to  determine  whether  the  signal  class  is 
asymptotically  covered  by  the  class  diagonal  in  y • 

Remark ;  These  definitions  and  approach  is  a  slight  modifi¬ 
cation  of  the  one  presented  in  (Pearl  [10]). 

We  also  use  the  notation  63  if  both  A  covers  r3  and 

covers  We  say  then  that  A  and  ^  are  asymptotically  equivalent. 

3.  DISCRETE  GENERATION  OF  TRANSFORMS  AND  THEIR  ASYMPTOTIC  PROPERTIES 


3 . 1  Numerical  Quadrature  and  Transform  Generation 

Numerical  quadrature  is  a  scheme  of  approximating  integrals  by 
finite  sums.  It  is  discussed  in  any  book  on  numerical  analysis  and 
to  a  deeper  extent  in  (Krylov  [11]).  Our  approach  is  to  study  the 
asymptotic  behavior  of  transform  nets  using  integral  approximations 
of  finite  sums.  A  brief  summary  of  numerical  quadrature  is  given 
i n  the  sequel . 

Let  X  be  a  real  interval  (usually  we  shall  take  X  =  [-1»1]  or 
[0,1]).  Let  dp(x)  be  a  measure  on  X.  Consider  integrals  of  the 
form  /  f(x)dp(x).  A  numerical  quadrature  is  a  sum  of  the  form 


QN(f) 


I  x"  f(x") 
i=o 


N-1 


are  called  the  weights  of  the 


{x!^}  the  discretization  points. 
'  i  =0 


N-1 


numerical  quadrature  and 
N  is  the  order  of  the  numerical  quadrature 


A  quadrature  is  exact 
of  order  N  if  for  any  polynomial  P(x)  of  order  £  N,  Qp(P)  = 

/  P(x)dp(x).  A  quadrature  converges  W.R.T.  a  given  class  of  func- 


X 

tions  F  if  for  any  f  €F  Q„(f)-*-  /  f(x)dy(x). 
-  n  X 


Consider  the  case  where  dy(x)  is  a  probability  measure  over 


i 


X  =  [-1,1]. 


To  demonstrate  the  process  of  generating  unitary  trans 


forms  by  discretization,  we  present  the  Gauss-Jacobi  quadrature 

formula.  Let  {Pfj(x)}  be  the  orthonormal  polynomials  W.R.T.  dM(x) 

obtained  by  the  application  of  the  Grahm-Schmidt  process  to 

x^  =  0,1, •••N.  It  is  known  (Szego  [12])  that  Pm(x)  is  a  polynom- 

ial  of  order  N  which  possesses  N  distinct  roots  in  (-1,1);  {x*!*} 

^  i  -0 

The  Gauss-Jacobi  theorem  states  that  there  are  numbers  (Christoffel 

,  M  N-1  - - — — ^ - 

Numbers)  {X.}  such  that 
i=o 

N  N-1  m 

(1)  xj  >  0  for  i  =  0  ...  N-1,  I  xj  =  1. 

i=o  ' 


(2)  The  Gauss-Jacobi  quadrature  Q|^(f)  4  I  f(x‘|'),  is 

exact  of  order  2N-1 . 

(3)  converges  for  any  f  for  which  /  f(x)dvi(x)  converges. 


This  theorem  implies  the  following  procedure  for  generating 

orthogonal  transform  nets.  The  polynomial  Pi,  •  P  •  I  0  <  k ,  j  <  w-1 

k  j 

is  of  degree  <  2N-1,  thus 


QN(Pk  •  Pj) 


fl 

Pk(x)Pj(x)dp(x) 

-1 


6 


kj’ 


therefore: 


N-1 

I 

i-o 


A?  Pk(x?)  PjCx;) 


If  we  let  4  then  is  a  unitary  transform 

net.  We  call  transform  generated  by  this  procedure  Gauss-Jacobi 
Transforms . 

Examples:  (1)  Let  X  «  [-1,1]  dy(x)  =  then  Pn(x)  = 

u/l-x^ 

cos  nir8  f^here  x  =  cosO)are  the  corresponding  orthonormal  polynomials 


ISO 


The  G-J  transform  is  formed  by  discretizing  •••  ^  at  the 

zeros  of  P^.  This  gives  the  cosine  transform  as  discussed  in 
(Ahmed  [8]). 


(2)  Let  X  =  d,i(x)  -  A^dx.p^(x)  =  - 

where  x  =  cos0  are  the  Tchebychev  polynomials  of  the  second  kind 
and  give  rise  to  the  sine  transform 


rxf 


Sin 


ITT 

n  +  1 


sin  =  a,  stn 


ITT 


where  are  normalization  factors. 


If  X  =  [-tt.tt]  and  if  we  consider  integrals  of  the  form 
1  f  ^  i  0 

I  f(e  "je  d0  then  we  have  similar  results  for  the  Newton- 


-ir 

Cotes  Quadrature  Scheme.  The  discretization  points  in  this  case 
are  equally  spaced  in  [0,  tt);  =  i  •  i  =  0  •••  N-1  and 
are  selected  so  that  the  resulting  quodrature  is  exact  of  order 
M-1 .  If  we  take  the  system  (e  of  orthonormal  functions  then 

the  exactness  of  the  quadrature  scheme  gives  the  familiar  Discrete 
Fourier  Transform. 

Using  arbitrary  measures  on  the  unit  circle  and  their  corres¬ 
ponding  orthonormal  trigonometric  polynomials  one  can  produce  many 
other  trigonometric  transforms. 

3 . 2  Termwise  Asymptotic  Behavior  Of  Transforms  -  Generalized 
Toeplitz  Matrices 

Let  X  =  [-1,  1],  dvi(x)  a  probability  measure  on  X,  and  P|^(x) 
the  corresponding  othonormal  polynomials.  If  Qp(f)  is  the  Gauss- 
Jacobi  quadrature  scheme,  then  by  G-J  theorem,  this  scheme  conver¬ 
ges  for  every  function  f,  such  that  /  f(x)  dy(x)  exists.  In  par- 
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f 


ticuiar,  if  f(x)  is  a  bounded  positive  function  then 


If  D|^  =  Diagonal  [f(xQ)  •  •  •  f(xjj  -j)]  and  is  the  Gauss- Jacobi 
Transform  corresponding  to  {P|^}  then  this  simply  implies  that 


[Cn  Dfj  Cj^] 


N 


/  P|((x)Pj  (x)f  (x)dp(x) 


That  is,  if  we  look  at  the  diagonal  net  A  {Dj^}  and  the  G-J  trans 

form  net  y  A  then  the  elements  of  the  matrices  of  the  trans- 

* 

formed  net,  y  y  tend  asymptotically,  termwi se ,  to  the  elements 
of  the  fixed  (infinite)  matrix 

[M(f)]kj  =  /  Pk(x)Pi(x)f(x)dp(x). 

X 

This  means  that  the  class  diagonal  in  y  is  termwise  asymptotically 
equivalent  to  the  class  of  finite  sections  of  infinite  matrices  of 


This  motivates  a  further  examination  of  such  classes.  Given 
an  orthonormal  system  measure  space  (X,  dp)  consi 
der  the  infinite  matrix  defined  by 


A 


f  (x)<}),^(x)(j.j(x)dp{x) 


where  f  is  positive  bounded  function.  With  each  such  matrix  we 
associate  a  net  of  finite  sections  of  M(f)  (i,j'  =  0,1, •••N-1);  we 
use  m^  to  denote  this  net.  The  class  of  all  such  nets  is  called 
Generalized  Toeplitz  class  associated  with 


The  class  diagonal  in  y  Is  termwise  asymptotically 
equivalent  to  the  generalized  Toeplitz  class  asso¬ 
ciated  with  fP|^(x)}|^  . 

Example  T :  The  generalized  Toepl i tz,  ma tri ces  associated  with 
0|^{x)  A  ~=  are  the  ordinary  Toeplitz  matrices  since: 

>Tr 

-ir 

(f(k-j)  is  the  k-j  Fourier  coefficient  of  f).  That  is,  M{f)  has 
constant  diagonals. 

From  here  on  we  designate  the  term  Toeplitz  matrix  to  matrices 
T  =  form  t^.j  =  t(|i-jl)  which  are  positive  definite. 

Such  matrices  are  the  autocovariance  matrices  of  stationary  signals. 

The  previous  result  implies  that  the  class  diagonal  in  the  DFT 
(circulant  matrices)  is  termwise  asymptotic  equivalent  to  the  Toep¬ 


litz  class.  This  parallels  the  result  of  Hamidi  and  Pearl 


[13] 


show¬ 


ing  the  vanishing  of  the  off-diagonal  elements  of  y  t  y*,  with  y 
being  the  net  of  DFT  matrices. 

Example  2:  Consider  again  the  DCT  which  has  been  shown  to  be 

the  G-J  transform  associated  with  the  Chebychev  polynomials  of  the 

first  kind  P|^(x)  =  cos  nirG  9  =  cos~^x.  We  now  find  the  generalized 

Toeplitz  matrices  associated  with  this  orthogonal  system.  Let  f(x) 

be  a  positive  bounded  function  on  X  =  [-1,  1].  Then 
/I  fir 


1 


1 


f(9)cos  kTr9  cos  ju9  ^0  =  ^ 


f (9 )cos ( k- j )9  d0 


fIT 


f (9)cos(k+j)0  d9 
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M{f) 


Thus,  a  typical  M(f)  is  the  sum  of  a  Toeplitz  matrix  and  a  Hankel 
matrix  (both  infinite),  with  the  same  parameters.  Loosely  stating, 
our  theorem  above  states  that  any  cosine-diagonal  net  tends  term- 
wise  to  such  a  generalized  cosi ne-Toepl i tz  matrix.  If  we  can  prove 
t  h  a  t  : 

(1)  The  generalized  cosine-Toepl i tz  class  is  an  asymptotic 
cover  of  the  Toeplitz  class 

and  (2)  Termwise  asymptotic  behavior  implies  ordinary  asymptotic 
equivalence  (in  weak  norm  sense), 

then  we  could  show  that  the  cosine  diagonal  class  covers  the  ToSp- 
litz  class. 

In  summary,  starting  with  a  measure  space  (X  dp)  and  an  ortho¬ 
normal  system,  we  followed  two  paths  to  produce  two  matrix  classes: 
* 

y  j^ry*  the  class  diagonal  in  a  corresponding  transform  net,  and 
m^,  a  generalized  ToSplitz  class.  Termwise,  these  classes  approxi¬ 
mate  each  other  asymptotically.  Faced  with  the  problem  of  showing 
y  ^  y  ^  T  we  prefer  to  show 

( 1 )  Y  »©■  Y  — *■ 

( 2 )  m^  -< — '->■  T 

This  approach  is  motivated  by  the  fact  that  m^  is  a  more  manageable 

* 

matrix  form  than  y  /Q-  y  because  the  elements  of  any  net  in  m^  do 
not  vary  with  N.  Consequently  the  asymptotic  behavior  of  the  nets 
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in  are  more  transparent. 

3 . 3  From  Termwise  To  Weak  Equivalence 

Theorem  1  :  Let  X  =  [-1  1],  d;;(x)  a  probability  measure  on  X 
which  has  a  density,  {P|^(x)|  the  orthogonal  polynomials  correspond¬ 
ing  to  d]j(x)  and  f  a  bounded  positive  function.  Define 


“ij  4 


as  in  3.2) 


“ij  4  /  f(x)Pi(x)P.(x)dp(x) 

X 


N-1 


Then  lim  ^  ^ 

i»j~o 


N 

“ij  " 


=  0 


This  means  that  y  Y  is  asymptotic  equivalence  to  m^. 

N 

Proof :  By  3.2  we  have  termwise  convergence,  i.e.  a.- - ->a.;.:V 

•  J  *  J  •  J 


Also,  from  [13]  (Theorem  6.2): 
N-1 


i  y 
^  i.J=o 


N-1 


i  ,J=o* 


N-1 

Now  i  ^ 


1  ,J  =  o 


N 

2 

“ij 

“ij 

2 

X. 

. 

N 

“ij 

2 

“ij 

-1 

1 

-1 


f  (x)dy  (x  ) 


fMx)dy(x) 


1  rr  N  x2  .  2  -I  2  N 

N  .  I  J-^“ii^  “ij^  '  N  .  l_.°‘ij  “ij 


N  .  *«  1  i 

1  ,  j  =  0 


1  ,  j=o 


rl 


Let  N  00  then  the  first  sum  goes  to  2 


f'^(x)dy  (x) . 


Now 


i  N  i  ”r'  /  X 


N-1 


1  ,J=0 


-1 

N-1 


T  n 


N  ,  N  X 

a  <  ,•  ( a  i  j  “  i  j  ' 
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<  1  I  a..  J!.  -  c.. 

-  "  f.j-o 


and  (as  generalized  Fourier  coefficient  of  f)  lim  ^ 

i  oo 


Let  c  >0  and  let  N  be  such  that  a.-  <  e  for  i,j  ^  N-1.  If  we 

*  ^ 

choose  N  >  N  large  enough  such  that  •  <  e  for  i , j  £  N-1 

we  get  ’ 


Nq-I 


1  r  0 

r  ,  \  “ij  “ij 


1  "o’’ 

I  r  0 

t: —  )  a  •  ‘  a  •  .  -  a .  • 

No  i,J=.N.l  ’J 


1  1  ^ 

-  ^  ^  “i  i  rT  .  ^  ,  "i  1  ■  “i  i 

^0  T,J=0  ”o  i,j  =  N-l 


r  0 

I  - 


These  two  sums  are  founded  because 


,11-1 

fj—  I  a.,  tends  to  f(x)dv(x)  ([13]  (Theorem  6.2))  and 
i,j=o  J  J_^ 

N 

a.^  — >  0  as  i , j  «>,  Thus 

I  J 

f 

■I  N-l  N  1  2 

if  .  I  “ij  “ij  ■  N  .  J  “ij  ^  =  constant 

which  proves  that 


2  N 

N  ,_|.o  “fj  “fj 


2  f‘^(x)dy(x) 


Thus 


1  N 

»  1.1-0 


M  CO 


Q.E.D. 


SUMMARY: 


We  have  shown  that  for  measures  with  density 

This  means  that  to  find  out  whether  y*  ^  y  ■=>  ^  it  is  enough 

(a)  To  find  the  form  of  generalized  ToSplitz  matrices  corres¬ 
ponding  to  dp (x  ) , 

(b)  Compare  those  infinite  matrices  with  signal  class  'J' ,  and 
prove  their  equivalence. 

3 . 4  Appl i cations 

1.  We  first  use  this  method  to  demonstrate  that  the  cosine- 
diagonal  class  is  an  asymptotic  cover  of  a  useful  Toeplitz  sub¬ 
class.  Consider  the  class  with  y  being  the  CDT  net.  As  shown 
before,  a  typical  net  is  of  the  form 


We  want  to  show  that  the  Toeplitz  part  is  asymptotic  equivalent  to 
the  Toeplitz  plus  Hankel  net.  This  means  that  the  Hankel  net  must 
tend  (in  weak  norm)  to  zero,  i.e.,  we  want  to  show 

lim  Jr  [“o  ^“l  ^“2  ‘  ^  ■  “Ll]  "  ° 

N  ->  00 

Clearly,  if  we  restrict  our  Toeplitz  class  to  satisfy  the  additional 

"  2 

sifoothness  condition:  \  w  aj:  <  «>  then  the  limit  above  approaches 

m=o 

IS8 


-i^vi 


-.’-SM 


0  like  N 


i 


We  conclude,  therefore,  that  the  class  of  stationary  signals 

“  2 

satisfying  I  m  <  “is  asymptotically  covered  by  the  cosine 
m=o 

transform.  This  is  a  substantial  generalization  of  Hamidi  and 
Pearl 's^^^  result.  It  implies,  for  instance,  that  the  DCT  is  asymp¬ 
totically  optimal  for  all  finite-order  Markov  signals.  Moreover, 
the  proof  of  this  statement  is  surprisingly  simple  involving  none 
of  the  tedious  calculations  used  in  [9]  for  Markov-1  processes. 

2.  The  same  procedure  can  be  applied  to  polynomials  on  the 

unit  circle  (trigonometric  polynomials)  which  yield  the  DFT  by 

★ 

discretization.  In  this  case  the  equivalence  y  y  has  a 

particular  meaning;  the  left  hand  side  are  the  circulant  matrices 
and  the  right  hand  side  is  just  the  ordinary  Toeplitz  class.  Thus 
we  readily  obtain  the  result  that  the  circulant  class  is  asymptotic 
equivalent  to  the  ToSplitz  class. 

CONCLUSIONS 

A  method  for  studying  the  asymptotic  behavior  of  spectral 

transformations  was  developed  using  numerical  quadrature  theory. 

Using  this  method,  the  asymptotic  optimality  of  common  unitary 

transforms  can  be  tested  conveniently.  A  practical  new  result 

obtained  by  this  method  states  that  the  discrete  cosine  transform 

is  asymptotically  equivalent  to  the  Karhunen-Lo6ve  transform  of 

”2 

stationary  signals  satisfying  I  ”  (s.g*  finite  order 

m=o 

Markov  processes). 


REFERENCES 


H,  C.  Andrews  and  W.  K.  Pratt,  "Transform  image  ceding,"  in 
Poly  teak.  Inst.  Brooklyn  Sy'-'.v.  Computer  Procescinj  ir.  Cunmun- 
icationy  Apr.  1969,  pp.  63-84. 

2.  W.  K.  Pratt  and  H.  C.  Andrews,  "Application  of  Fouri er-Hada- 

mard  transformation  to  bandwidth  compression,"  in  .  T .  Symp. 

Picture  Bandwidth  Compressiony  Apr.  1969. 

3.  A  Habibi  and  P.  A.  Wintz,  "Image  coding  by  linear  transforma¬ 
tion  and  block  quantization,"  IEEE  Trans.  Commun.  Technol.y 
vol.  COM-19,  pp.  50-62,  Feb.  1971. 

4.  S.  J.  Campanella,  and  G.  S.  Robinson,  "Digital  sequency  decom¬ 
position  of  voice  signals,"  Proceedings  of  the  Symposium  on  the 
Applications  of  Walsh  FunctionSy  Washington,  D.  C.,  March  1970, 
pp.  230. 

5.  R.  W.  Means,  H.  J.  Whitehouse,  and  J.  M.  Speiser,  "Television 
encoding  using  a  hybrid  discrete  cosine  transform  and  a  differ¬ 
ential  pulse  code  modulator  in  real  time,"  Proceedings  of  the 
IEEE  National  Telecommunication  Conference ,  December  2,  1974, 
San  Diego,  California. 

6.  J.  Pearl,  "Basis-restricted  transformations  and  performance 
measures  for  spectral  representations,"  IEEE  Trans.  Inform. 
Theory y  vol .  IT-17,  pp.  751-752,  November  1971. 

7.  0.  Pearl,  "On  coding  and  filtering  stationary  signals  by  dis¬ 
crete  Fourier  transforms,"  IEEE  Trans.  Information  Theory, 
vol.  IT-19,  No.  2,  pp.  229-232,  March  1973. 

8.  N.  Ahmed,  T.  Natarajar,  and  K.  R.  Rao.IffEff  Trans.  Comput.  vol. 
C-23,  pp.  90-93,  January  1974. 

9.  M.  Hamidi  and  J.  Pearl,  "A  comparison  of  Fourier  and  cosine 
transforms  of  Markov-1  signals,"  uCLA-EAL-HEPORT-7565  ,  Novem¬ 
ber  1975.  Submitted  to  IEEE  Trans,  on  Acoustics,  Speech  and 
Signal  Processing. 

10.  J.  Pearl,  "Asymptotic  equivalence  of  spectral  representations," 
To  be  published  in  IEEE  Trans,  on  Acoustics,  Speech,  and  Signal 
Processing,  December  1975. 

11.  V.  I.  Krylov,  Approximate  Calculation  of  Integrals  ,  New  York, 
Macmillan,  1962. 

12.  G.  Szego,  Orthogonal  Polynomials  ,  New  York,  American  Mathema¬ 
tical  Society,  1959. 

13.  V.  Grenander  and  G.  Szego,  "Toeplitz  forms  and  their  applica¬ 
tions,"  University  of  California  Press y  Berkeley  and  Los  Ange¬ 
les,  1958. 


160 


M.  Hann'di  and  J.  Pearl,  "On  the  residual  correlation  of  fin 
i te-ditnensional  discrete  Fourier  transforms  of  stationary 
signals,"  IEEE  Trans,  on  Information  Theory,  vol .  IT-21,  pp 
480-482.  July  1975. 


APPENDIX  I 
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REDUCTION  USING  CHARGE  TRANSFER  DEVICES 


REAL  TIME  TELEVISION  IMAGE  BANDWIDTH  REDUCTION 
USING  CHARGE  TRANSFER  DEVICES* 

H.  J  Whitehouse 
R.  W  Means 
E  H  Wrench 


Abstract 


Recent  advances  in  analog  semiconductor  technology  have  made  possible  the  direct  sensing  and  processing  of  television  unages.  By 
combming  a  charge  transfer  device  (CTD)  imager  and  a  CTD  transversal  filter,  real  tune  unage  sensing  and  encoding  have  been  achieved 
with  low  power  mtegrated  circuits  so  that  digital  transmission  and  bit  rate  reduction  are  made  possible  using  differential  pulse  code 
modulation  IDPCM).  Good  mean  square  error  performance  and  freedom  from  DPCM  artifacts  are  possible  in  a  hybrid  intraframe  unage 
encoder  The  hybrid  transform  encoder  performs  a  discrete  cosine  transform  (DOT)  on  each  Ime  of  the  television  image  as  it  is  scanned. 
This  compacts  the  variance  into  low  frequency  coefficients  and  the  DPCM  encodes  the  corresponding  DCT  coefficients  between  successive 
lines.  Computer  simulation  of  this  hybrid  coding  technique  has  shown  good  performance  on  2S6  x  256  pixel  images  at  0.5  bits/pixel 
and  channel  bit  error  rates  of  I0~^.  An  experimental  system  using  a  low  resolution  General  Electric  100  x  100  charge  injection  device 
camera  and  a  Texas  Instruments  bucket  brigade  transversal  filter  as  part  of  the  DCT  processor  has  been  constructed  and  provides  good 
low  resolution  image  quality  at  I  bit/pixel  and  bit  error  rates  of  10'-^  A  high  resolution  vidicon  compatible  system  is  also  being 
constructed. 


Introduction 

Unitary  transforms  for  image  encoding  have  been  used  for  intraframe  encoding.' '  ’  In  addition,  these  techniques  may  also  be 
applied  to  interframe  and  multispectral  encoding  However,  all  unitary  transformations  are  information  preserving  and  no  bandwidth 
reduction  results  from  the  application  of  the  transform  to  the  image.  Instead,  the  transforms  redistribute  the  variance  associated  wtth 
each  picture  element  (pixel)  so  that  subsequent  to  the  transform,  basis  restricting  operations  on  the  transform  coefficients  will  result  in 
bandwidth  reduction.  Upon  reconstruction  of  the  original  image  from  the  basis  restricted  transform  coefficients,  a  degraded  version  of  the 
origmal  image  can  be  obtained.  Unfortunately,  the  interrelationship  between  the  type  of  transform,  the  form  of  the  noninvenible  oper¬ 
ation.  and  the  type  of  degradation  in  the  reconstructed  image  is  very  complicated  and  subjective  The  universally  used  analytic  cnterion 
of  the  mean-square-error  is.  at  present,  the  best  compromise  technique  for  transform  comparison. 

For  the  particular  operation  of  basis  restriction  by  truncation,  a  panicularly  simple  interpretation  of  the  bandwidth  reduction  can 
be  made.  The  transforms  may  be  viewed  as  a  variance  redistributing  operation  that  approximately  decorrelates  the  transform  coefficients 
whde  transforming  the  variance  associated  with  each  picture  element  into  the  low-order  coefficients  of  the  transform.  Under  the  assump¬ 
tion  that  each  set  of  picture  elements  can  be  considered  as  a  sample  function  from  a  wide  sense  stationary  random  process  with  correlation 
function  ri^',  there  exists  an  optimum  discrete  transformation,  the  Karhunen-Loeve  transformation,  which  totally  decorrelates  the  trans¬ 
form  coefficients  and  maximally  compacts  the  variance  to  the  low-order  coefficients.  All  other  transformations  can  be  compared  in  their 
performance  by  comparing  their  transform  coefficient  decorrelation  and  variance  compaction  with  this  optimum  transformation. 

This  intuitive  interpretation  can  be  made  rigorous  through  the  use  of  the  rate-distortion  criterion.*--  It  has  been  found  from 
experience  that  the  closer  the  eigenvectors  of  the  transformation  are  to  the  eigenvectors  of  the  optimum  Karhunen-Loeve  transformation 
the  greater  the  variance  is  compacted  and  the  more  the  coefficients  can  be  truncated  while  maintaining  a  fixed  rate  distortion  or  mean- 
square-error. 


Transform  Encoding 


Karhunen-Loeve  Traniformation 

If  a  continuous  time  function  of  lero  mean  and  autocorrelation  function  R(t)  •  is  considered  to  be  a  sample  function  from  a 
wide-sense  stationary  random  process,  then  this  time  function  can  be  explicitly  expanded  by  the  Karhunen-Loeve  expansion*^*  and  the 
resulting  coefficients  will  be  uncorrelated.  For  a  discrete  function  of  zero  mean  and  autocorrelation  function  R(r)  ■  rl^l,  which  may  be 
considered  as  a  sample  function  from  a  first-order  Markov  process,  a  similar  discrete  Karhunen-Loeve  transformation  may  be  defined.*^) 
This  tranafonnation  diagonalizes  the  covariance  matrix  and  is  optimal  in  the  mean-square-enror  sense  for  a  restricted  set  of  basic  functions 
that  do  not  span  the  complete  space. 

The  discrete  Karhunen-Loeve  expansion  is  given  by*^*  for  the  case  N  •  2m  as 

2m 

‘^k  *  S  - — T  {"n  (It  -(-tn  +  '  V21  n»/2  \  g^ 

n-i  2m-fX-  ’  > 

k-1.2 . 2m  (I) 


where 


I  -r- 


I  -  2r  cos  ♦  r* 


(2) 


•To  be  pubtithedinSPlE  Conference  Proceedings  (Aug.  1975). 
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and  Wjj  are  the  first  N  positive  roots  of 


tan  ^mu)  = 


-<  I  -r~)  sin  to 

■y 

( cos  u)  -’r  +  r“  cos  lj  ) 
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Since  the  discrete  Karhunen-Loeve  expansion  involves  both  the  solution  of  a  transcendental  equation  and  the  evaluation  of  the  auto¬ 
correlation  function  of  the  data  to  he  transformed,  real  time  computation  of  this  transform  is  quite  complex  However.  Habibi  and 
Wintzl  * )  have  shown  that  Karhunen-Loeve  transformations  calculated  using  approximate  autocorrelation  functions  are  satisfactory  for 
many  applications. 

Discrete  Fourier  Transform 

Since  the  discrete  Fourier  transform  is  asymptotic  to  the  Karhunen-Loeve  transformation^^'  for  the  exponential  covariance  function 
and  the  basis  vectors  are  picture  independent,  the  Fourier  transform  represents  a  logical  choice  for  real  time  implementation.  The  Fourier 
transform  exists  for  all  data  lengths  N .  This  is  defined  by 


N-1 

gn  lc  =  0,  1 _ N-1  (4) 

n=0 


Discrete  Cosine  Transform 


Two  different  types  of  discrete  cosine  transform  (DCT)  are  useful  for  reduced  redundancy  television  image  transmission.  Both  are 
obtained  by  extending  the  length  N  data  block  to  have  even  symmetry,  taking  the  discrete  Fourier  transform  iDFT)  of  the  extended  data 
block,  and  saving  N  terms  of  the  resulting  DFT  Since  the  DFT  of  a  real  even  sequence  is  a  real  even  sequence,  either  DCT  is  its  own  in¬ 
verse  if  a  normalized  DFT  is  used. 

The  "Odd  DCT”  (ODCT)  extends  the  length  N  data  block  to  length  2N  -  1 .  with  the  middle  point  of  the  extended  block  as  a  center 
of  even  symmetry.  The  "Even  DCT"  (EDCT)  extends  the  length  N  data  block  to  length  2N,  with  a  center  of  even  symmetry  located  be¬ 
tween  the  two  points  nearest  the  middle.  For  example,  the  odd  length  extension  of  the  sequence  A  B  C  is  C  B  .\  B  C.  and  even  length  is 
C  B  A  A  B  C.  In  both  cases,  the  syrametrization  eliminates  the  jumps  in  the  periodic  extension  of  the  data  block  which  would  occur  if 
one  edge  of  the  data  block  had  a  high  value  and  the  other  edge  had  a  low  value,  in  effect,  it  performs  a  sort  of  smoothing  operation  with 
no  loss  of  information.  It  will  be  noted  that  the  terms  "odd”  and  "even"  in  ODCT  and  EDCT  refer  only  to  the  extended  data  block  - 
in  both  cases  the  extended  data  block  has  even  symmetry. 

Let  the  data  sequence  be  gg,  g| . gxj,l  The  ODCT  of  g  is  defined  as 

xj.|  -i2irnk 

*^k=  S  g„  e  for  k  >=  0,  1 . N-1  (5) 

n=-(N-l) 

where 


8-n  ^  *n  ^  ' - ^  • 

By  straightforward  substitution  it  may  be  shown  that 

N-1  -ilirnk 

G,j  =  2  Re  2  e  ' 
n=0 


where  yis  defined  by  equation  (8). 


|0.5go.n»0  j 

i  I 

'  gp.  n  =  1. .  .  .  ,  N  -  1  ' 

The  EDCT  of  g  is  defined  by  equation  (9),  where  the  extended  sequence  is  defined  by  equation  (10). 

-iirk  fj_|  -i2rmk 
■■N  2N 

G^  -  e  e  for  k  =  0.  1 . N-1 

n«-N 


(6) 
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If  the  mutually  complex  conjugate  terms  in  equation  (9)  are  combined,  then  equation  i 
an  alternate  way  of  defining  the  HDCT. 


1 1 1  results.  Equation  (II)  may  be  viewed  as 
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.\hmed* ' '  has  investigated  the  use  of  the  EDCT  as  a  substitute  lot  the  Isarhunen-Loese  transiorm  and  finds  that  it  is  superior  to  the 
Fourier  transform  and  is  comparable  to  the  Karhunen-Loeve  i  K-L)  m  rate-distortion  perlormance  while  maintaining  the  computation 
simplicity  of  a  transform  which  does  not  depend  on  the  picture  statistics  Habihi'*^'  has  shown  by  simulation  that  the  DOT  is  equivalent 
in  a  mean-square-erior  sense  to  the  K-L  transform  under  basis  restriction 


Hybrid  Transforms 

•Mixed  transforms  can  be  used  either  for  intraframe  encoding  or  interframe  encoding  depending  on  the  available  memory  and  the 
type  of  transforms  implemented.  In  order  for  a  mixed  transform  to  be  competitive  wnh  one  of  the  conveniional  two-dimensional  trans¬ 
forms.  It  must  offer  either  supenor  performance  or  simplicity  of  implementation  Of  the  iransforms  examined  the  odd  length  cosine 
transform  is  competitive  m  performance  since  it  can  b  olemented  as  the  real  part  ol  i  C2T  and  since  the  transform  samples  are  real 

and  are  the  samples  of  an  autocorrelation  function  wh  may  then  be  extrapolated  bs  well-known  techniques  In  simulations  of  trans¬ 

form  performance,  the  cosine  transform  has  been  shown  to  closely  approximate  the  bchanor  of  the  Karhunen-Loeve  transform 

The  benefits  of  mixed  transformation  implementation  and  minimum  memory  may  be  achieved  for  digital  transmission  by  combining 
a  one-  or  two-dimensional  unitary  transform  with  generalized  DPCM  in  a  hybrid  system  The  basis  of  operation  of  the  hybrid  transform 
IS  that  the  unitary  transform  decorrelates  the  image  within  its  constraints  of  transform  type,  dimensionality,  and  block  size,  while  the 
generalized  DPCM  removes  the  correlation  between  transform  blocks.  This  hybrid  system  is  particularly  attractive  for  remote  sensor 
application  since  it  has  been  found  that  its  performance  is  approximately  as  good  as  the  Karhunen-Loeve  transform  and  its  implementation 
requires  minimum  memory  i*' 


Implementation 


Computational  Modules 

A  linear  transform  on  sampled  data  of  finite  extent  may  be  viewed  as  the  mulliplicat  on  of  a  vector  by  a  matrix.  Multiplication  by 
diagonal,  circulant,  or  Toeplitz  matrices  may  be  accomplished  rapidly  with  simple  computational  hardware  modules.  Multiplication  by 
an  N  K  N  diagonal  matrix  requires  only  a  scalar  multiplier  and  a  memory  containing  N  values  to  provide  serial  access  to  the  reference 
function.  .Multiplication  by  an  N  X  N  Toeplitz  matrix  corresponds  to  a  convolution  and  may  be  performed  usmg  a  transversal  filter 
having  2N-I  taps.  Multiplication  by  an  N  X  N  circulant  is  a  special  case  of  multiplication  by  N  X  N  Toeplitz  matnx  in  which  the  length 
of  the  transversal  filter  may  be  reduced  to  N  taps  if  the  data  block  is  recirculated  through  the  filter  or  reread  into  the  filter  from  a  buffer 
memory 

One-Dimensional  DFT 


Linear  filters  have  been  used  for  many  years  for  the  calculation  of  the  power  spectra  of  continuous  signals.  One  of  the  earliest 
methods  used  a  bank  of  wave  filters  to  measure  the  spectra  in  fractional  octave  bands  for  telephone  network  equalization.^^^  However, 
when  increased  resolution  was  required  the  number  of  fillen  rapidly  become  unmanageable.  An  alternative  which  overcame  the  difficulty 
of  a  large  number  of  filters  each  with  small  time-bandwidth  product  was  to  substitute  one  linear  fm  (chirp)  filter  with  large  time-bandwidth 
product  and  to  employ  matched  filtering  In  this  system  the  signal  to  be  analyzed  is  used  to  single  sideband  (SSB)  modulate  a  locally 
generated  chirp  signal  and  the  composite  modulated  signal  is  filtered  in  a  chirp  delay  line  matched  filter.  Each  component  of  the  input 
signal  spectrum  shifts  the  locally  generated  chirp  to  a  different  position  in  the  spectrum  after  SSB  modulation  and  these  shifted  chirps 
then  correlate  with  the  reference  signal  represented  as  the  impulse  response  of  the  matched  filter  at  different  times.  Thus  the  output 
signal  amplitude-time  history  reflects  the  amplitude-frequency  composition  of  the  input  signal. 

Bleustein* '  recognized  that  the  discrete  Fourier  transform  (DFT)  of  sampled  data  was  amenable  to  a  similar  interpretation.  In 
addition  to  just  calculating  the  magnitude  of  the  Fourier  transform,  linear  filters  could  calculate  the  phase  and  thus  all  of  the  operations 
such  a  cross  convolution  and  a  cross  correlation  could  be  calculated.  This  technique  came  to  be  called  the  chirp-Z  transform  (CZT)  and 
can  be  applied  to  other  problems  besides  just  the  calculation  of  the  DFT.*  *  *  *  Prior  to  these  developments,  digital  computation  of  the 
DFT  had  been  significantly  improved  by  the  use  of  a  special  algorithm  called  the  fast  Fourier  transform  (FFT)  which  was  described  by 
Cooley  and  Turkey.*'-^  The  FFT  algonthm  gained  rapid  popularity  in  signal  processmg  since  it  allowed  the  calculation  of  the  DFT  to 
be  done  using  significantly  fewer  machine  operations  (multiplications)  than  direct  evaluation. 

By  direct  inspection  it  is  observed  that,  if  symmetries  of  the  function  exp  jir2nm/N  are  not  exploited,  then  the  number  of  complex 
multiplications  required  will  be  N-  corresponding  to  N  multiplications  for  each  frequency  component  evaluated.  Even  on  high  speed 
digital  computers  this  can  become  the  limiting  consideration  in  signal  processing  applications.  The  advantage  of  the  FFT  algorithm  is 
that  for  highly  composite  values  of  the  OFT  size  N  the  number  of  multiplications  is  proportional  to  N  logvN. 

.Although  the  FFT  has  been  successful  in  substantially  reducing  the  computing  time  and  cost  of  using  general  purpose  digital  com¬ 
puters  It  has  several  disadvantages  for  special  purpose  real  time  computation.  At  high  throughput  rates  which  are  required  for  real  time 
linage  processing  the  processor  either  must  operate  lof2N  times  faster  than  the  data  rate  or  pipeline  structures  which  use  distributed 
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memory  and  log-iN  multipliers  must  be  used.  In  addition,  the  internal  arithmetic  of  the  F-FT  processor  must  be  done  at  increased  preci¬ 
sion  in  order  to  ^mpensate  for  the  multiple  round  off  errors  introduced  by  the  successive  stages  in  the  FFT  processor  Although  these 
difficulties  can  be  overcome,  it  is  not  always  possible  to  arrange  the  computation  in  a  form  where  the  size  of  the  transform  is  highly 
composite.  For  the  above  reasons  and  because  of  the  difficuliy  of  obtaining  small,  low  power,  fast  analog  to  digital  converters.  Imear 
transversal  filter  implementations  of  the  chirp-Z-transform  are  attractive**-^'  rather  than  the  previous  (  ZT  implementation  which  used 
an  FFT  to  perform  the  required  convolution 

The  DFT  may  be  easily  reduced  to  the  form  suitable  for  linear  filtering  by  the  substitution 


'  n-  -In  -  ml-  • 


I  i:i 


which  changes  a  product  of  variables  into  a  difference  so  that 
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This  form  is  seen  to  be  equivalent  to  factoring  the  Fourier  mai.ix  F  into  the  product  ol  three  matnces 
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where  D  is  a  diagonal  matrix  with  elements  d,,^  «  exp  l-|irn-  Ni  and  T  is  a  Toeplitz  matnx  with  elements  t„^  »  exp  ijirin  -  ml*  N). 

The  CZT  algorithm  is  easily  implemented  by  transversal  filter  techniques  In  this  case  the  DFT  is  computed  by  premultiplication 
by  a  discrete  chirp,  convolution  with  a  discrete  chirp,  and  postmultiplication  by  a  disnete  chirp  Figure  1  shows  this  configuration. 
Flowever.  it  must  be  remembered  that  both  the  multiplications  and  convolutions  are  complex  and  a  suitable  representation  of  the  com¬ 
plex  numbers  must  be  used.  One  representation  is  by  real  and  imaginary  part  Figure  2  shows  the  DFT  organized  as  a  CZT  and  imple¬ 
mented  with  parallel  conputation  of  the  real  and  imaginary  parts.  In  Figure  2  the  input  signal  is  represented  as;  ,,nd  the  out¬ 
put  signal  is  represented  as  C  '  f'R  J<a|.  where  it  is  understood  that  g  n  =  0.  .  N  -  I  and  n*0.  .N-l 

In  order  to  determine  the  speafic  form  of  the  transversal  filters  it  is  necessary  to  know  the  specific  value  of  N  When  N  is  odd  the 
Toeplitzmatnx  T  may  be  represented  as  a  transversal  filter  with  2N  -  1  complex  taps  h_(^.|^to  h^_|  where  h,,  *  W*"'  -  n  *  -iN-'.  ■ 
to  N-l ).  and  W  =  exp  (-|2iriN),  The  required  convolution  has  oeen  unplemented  with  the  general  transversal  filter  shown  m  Figure  ' 
When  N  is  even,  it  can  be  shown  tha  T^  »  ^N-cn  m  fh*  subscripts  are  reduced  mod  N'.  Thus  T  is  a  cuculant  mairix  and 

can  be  implemented  with  a  recirculating  transversal  filter  as  shown  in  Figure  4  where  the  number  of  complex  taps  is  N  and  ihe  tap  ■*  eights 
are:  -  n»0 . N-l 

Charge  Coupled  Devices 


CCDs  are  sampled  data  analog  circuits  which  can  be  fabricated  by  Metal  Oxide  Semiconductor  IMOS)  technology  as  LSI  compo- 
nents.^*^*  As  such  they  are  directly  compatible  with  other  MOS  cucuits  Current  CCD  transversal  filters  have  operated  as  video  devices 
with  sample  rates  up  to  5  MHz.  CCDs  operate  by  the  manipulation  of  injected  minority  carriers  in  potential  wells  under  MOS  capacitors 
and  thus  behave  as  capacitive  reactances  with  low  power  dissipation.  However,  since  the  potential  wells  which  contain  the  muionty  car¬ 
riers  also  attract  thermally  generated  minority  earners,  there  is  a  maximum  storage  tune  for  the  analog  signal  which  depends  on  the  dark 
current  associated  with  the  temperature  of  the  silicon  Under  normal  conditions  at  room  temperature,  dark  currents  are  lens  of  nAmps, 
cm*  and  storage  times  of  hundreds  of  milliseconds  can  be  achieved. 

There  are  many  ways  in  which  unidirectional  charge  transfer  can  be  achieved.  The  first  developed  was  a  three-phase  clockuig  struc¬ 
ture  which  is  illustrated  in  the  transversal  filter  of  Figure  5  The  three  electrode  CCD  structure  is  planar,  much  like  the  SAW  devices,  and 
the  direction  of  charge  propagation  is  determined  by  the  sequence  of  potentials  appUed  to  the  three  electrodes.  Unfortunately,  if  the 
minority  carriers  are  allowed  to  collect  at  the  semiconductor-oxide  boundary,  poor  charge  transfer  efficiency  will  result  due  to  minonty 
carriers  getting  caught  in  trapping  sites.  This  means  that  the  CCD  will  behave  nonlinearly  unless  there  is  sufficient  propagating  charge 
present  to  fill  all  of  the  traps.  By  biasing  the  operating  condition  of  the  CCD  so  that  about  I0°F  of  the  dynamic  range  is  used  for  the 
injection  of  a  “fat  zero,"  the  traps  are  kept  contmuously  filled  and  the  device  has  over  a  60  dB  dynamic  range.  In  practice,  a  video  signal 
representing  the  signal  to  be  processed  is  added  to  a  fixed  bias  somewhat  larger  than  one-half  of  the  peak-to-peak  value  of  the  signal. 
Since  the  effective  storage  time  of  the  device  is  long  relative  to  the  time  required  to  execute  a  convolution.  CCDs  can  be  considered  to  be 


*  Denotes  either  convolution  or  circular  convolution 


Figure  I .  Chiip-Z  Transform  Implementation  of  the  DFT 


Figure  2.  DFT  Via  CZT  Algorithm  with  Parallel  Implementation 
of  Complex  Arithmetic 


Figure  5.  Schematic  of  the  Sampling,  Weighting,  and  Summing  Operation 


interruptible  signal  processors  and  as  such  are  more  compatible  with  the  executive  control  required  for  signal  processuig.  A  64  point  CCD 
filter  with  discrete  cosine  transform  sine  and  cosine  chirps  is  shown  in  Figure  6.  This  chip  was  developed  by  Texas  Instruments  for  the 
Naval  Undersea  Center  for  image  processing.  In  addition  to  the  four  DCT  filters,  four  DFT  filters,  a  Hilbert  transform  and  other  experi¬ 
mental  signal  processing  functions  were  also  implemented. 

Current  research  m  CCDs  is  directed  toward  improving  the  charge  transfer  efficiency  and  removing  the  requirement  of  continuous 
“fat  zero"  charge  irtiection  by  ion  unplantation  techniques  which  keep  the  minority  earners  away  from  the  semiconductor  oxide  bound¬ 
ary.  Ion  implantation  is  also  being  used  to  provide  asymmetric  potential  wells  so  that  simpler  two-phase  c'.ockmg  can  be  employed.  C  ar- 
rently  available  CCDs  have  SOO  stages  with  0.9999  transfer  efficiency  and  devices  with  up  to  ;000  stages  are  planned. 

.Another  charge  transfer  device  similar  to  the  CCD  is  the  Bucket  Brigade  Device  (BBDl  This  is  a  sequence  of  MOS  transistors  coupled 
together  by  diffusion  enhanced  Miller  capaatance.  Although  these  devices  do  not  operate  at  frequencies  as  high  as  CCDs,  they  have 
better  low  frequency  performance  since  they  include  active  devices.  A  CZT  has  been  implemented  with  two  BBD  chips.  Two  200  tap 
fillers  are  implemented  on  each  chip:  one  a  discrete  cosine  and  the  other  a  discrete  sine  filter  The  BBD  chip  is  shown  ui  Figure  The 
complex  chirp  used  in  the  premultiplier  and  a  typical  input  and  output  are  shown  m  Figure  S.  The  input  is  an  offset  cosine  wave  and  the 
output  shows  a  D  C.  component  plus  a  response  at  the  cosine  wave  frequency.  These  filters  can  operate  at  )  00  kHz  and  have  tap  accura¬ 
cies  better  than  I  i’-.  With  careful  control  of  geometry,  both  BBD  and  CCD  filters  with  tap  accuracies  approaching  0. 1~  should  be  possible. 
This  chip  was  also  developed  by  Texas  Instruments  for  the  Naval  Undersea  Center. 
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Figure  6.  64  Point  CCD  Filters 
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Figure  H.  BBD  Performance 


Diy-rete  Cosuie  Transl'orm 


L;t  the  data  sequence  be  gQ,  g| . SN-1  ODCT  of  g  is  defined  in  (S)  as 


N-l  -)2irnk 

‘^'k  "  S  Sn  *  fork  =0.1.  .N-l 

n=-(N-ll 


The  identity  1 1 5 )  may  be  used  to  obtain  the  CZT  form  of  the  ODCT  shown  in  equation  (lb). 
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The  block  diagram  of  the  ODCT  is  shown  in  Figure  9.  Since  only  real  inputs  and  outputs  are  requued.  a  simplified  implementation  is 
possible  and  is  shown  in  Figure  1 0. 

A  corresponding  implementation  may  be  found  for  the  EDCT.  The  EDCT  of  g  is  defined  by  equation  (9),  where  the  extended  se¬ 
quence  us  defined  by  equation  ( 10). 
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Figure  9  ODCT  Block  Diagram 
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Figure  10.  ODCT  Expanded  Block  Diagram 


If  the  mutually  complex  terms  above  are  combined  and  the  identity  (15)  used  the  CZT  lorni  of  the  EDCT  becomes 


)•>  >  > 
-)»k  -)»k-  ^_|  7)»n*  )irin-k)- 

IN  -N  ^  / 


'  He  ,  e 


r 


( 1') 


n=0 


Simultaneous  Computation  of  the  DFT  and  ilic  I'-tT 

The  close  relationship  between  the  OFT  and  the  DOT  permits  the  use  of  common  modules  to  simultaneously  compute  both  trans¬ 
forms.  This  may  be  accomplished  most  simply  when  an  EDCT  is  computed  using  DFT  modules.  The  sum  in  the  EDCT  defining  equation 
( 1 1')  may  be  interpreted  as  a  length  'N  DFT  of  the  extension  of  the  function  g  by  N  zeros.  This  leads  to  the  configuration  shown  in 
Figure  1 1 .  Alternatively,  if  the  odd  and  even  Irequencies  in  the  zero-filled  DFT  are  considered  separately,  they  may  be  computed  using 
length  N  DFT  modules  as  shown  in  Figure  I  ' 

System  Descriptions 


Two  hybrid  DCT/DPCM  bandwidth  reduction  systems  have  been  selected  for  construction  and  evaluation.  A  block  diagram  of  the 
systems  is  shown  in  Figure  13.  The  tint  uses  a  slow  scan  image  sensor  and  a  Bucket  Brigade  Device  (BBD)  transform  implementation. 

The  second  uses  an  ordinary  vidicon  sensor  and  a  Charged  Coupled  Device  (CCD)  transform  implementation. 

In  the  BBD  system  a  100  x  100  pixel  solid  state  sensor  is  used.  The  nominal  horizontal  line  scan  lime  is  one  millisecond.  The 
nominal  frame  rale  is  10  frames  per  second  which  can  be  displayed  without  flicker  through  the  use  of  a  scan  converter.  The  1  millisecond 
line  scan  time  was  chosen  in  order  to  match  the  sensor  to  the  BBD  filter  which  operates  at  a  clock  rate  of  100  kHz  with  good  charge  trans¬ 
fer  efficiency.  At  10  frames  per  second,  image  motion  should  be  reproduced  well  enough  for  many  applications,  even  though  some  pic¬ 
ture  detail  is  lost  because  of  the  low  spatial  samplmg  afforded  by  the  100  x  100  pixel  format.  .Minimum  overall  bit  rate  is  achieved  by  a 
combination  of  zonal  filtering  and  variable  bit  assignment  with  low  spatial  frequencies  assigned  more  bits  of  quantization  than  high  spatial 
frequencies.) ) ) 

The  second  bandwidth  reduction  system  is  compatible  with  a  standard  vidicon  camera.  It  uses  CCD  filters  for  the  cosine  transform 
which  will  operate  at  4.8  MHz  sampling  rate  with  a  block  size  of  33  pixels.  Compatibility  with  standard  television  formn  ■« 
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Figure  I  1 .  Computation  of  the  DFT  and  EDCT  Using  a  Single  Length  2N  DFT  Module 
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Figure  1 2.  Computation  of  the  DFT  and  EDCT  Using  Two  Length  N  DFT  Modules 
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Figure  13.  Image  Transmission  System 
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in  as  many  aspects  as  possible.  If  the  interlace  field  is  used  directly  as  the  input  to  the  transform  hard's  are.  a  resolution  of  approximately 
i;40  lines  by  15b  pixels  is  possible  at  f>0  fields  sec.  This  is  equivalent  to  a  video  bandwidth  slightly  less  than  J  5  \|Hr  Computer  simula¬ 
tion  of  this  system  is  shown  m  Figure  14  for  I  bit  pixel.  P  is  the  channel  bit  error  rate 

Tlie  implementation  of  selected  transforms  via  transversal  filters  was  proposed  at  the  All  .Applications  Digital  Computer  Conference.' '  * 
The  computation  of  the  discrete  Fourier  transform  has  been  demonstrated  using  surface  wave  devices  '  '  ''  The  discrete  cosine  translorm 
has  been  demonstrated  using  bucket  brigade  devices.* 

The  DOT  implementation  block  diagram  is  shown  in  Figure  10.  The  convolutions  are  performed  in  both  television  systems  by  charge 
transter  devices  built  by  Texas  Instruments.  The  multiplications  are  performed  by  conventional  cncuiiry.  The  reference  functions  used 
in  the  pre-  and  post-multiply  can  be  stored  in  a  real  only  memory. 

Figure  15  is  the  implementation  block  diagram  for  the  DPCM  part  of  the  system.  The  memory  is  a  line  store  which  is  used  as  the  pre¬ 
dictive  element  in  the  DrcM.  The  cosine  transform  coefficients  stored  in  the  memory  are  subtracted  from  the  new  transform  coefficients 
as  they  enter  the  DPCM.  The  difference  is  quantized  and  transmitted  with  a  selectable  number  of  bits  per  coefficient,  the  number  being 
a  function  of  the  assumed  variance  of  the  coefficient.  The  difference  coefficients  ate  then  added  to  the  previous  line  coefficients  to  create 
the  predictive  element  for  the  next  line. 

The  BBD  slow  scan  system  is  shown  in  Figure  16.  It  consists  of  a  discrete  cosine  transform,  a  DPCM.  a  channel  simulator  in  which 
bit  errors  can  be  injected,  an  inverse  DPCM,  and  an  inverse  DCT.  The  system  is  built  on  eight  wire  wrapped  boards.  A  blow-up  of  one  is 
shown  in  Figure  1 7.  The  two  chips  in  the  center  are  the  BBD  devices.  System  performance  is  illustrated  in  Figure  18  at  2  bits/pixel. 

Because  the  original  picture  has  only  100  x  100  pixels,  adjacent  pixels  are  not  as  well  correlated  as  in  the  256  x  256  simulation  and  band¬ 
width  compression  algorithms  do  not  work  as  well.  The  system  performance  at  one  bit  per  pixel  is  shown  in  Figure  19.  Results  from  the 
CCD  high  resolution  system  are  not  yet  available. 


Figure  1 4.  Computer  Simulation  of  CCD  System 
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Figure  15.  DPCM  Encoder 
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Conclumon 


The  use  of  an  intraframe  hybrid  transformation  composed  of  a  horizontal  unitary  discrete  cosine  transform  and  a  vertical  first -order 
DPCM  has  been  shown  through  simulation  to  have  performance  closely  approximating  that  of  a  two-dimensional  Karhenun-Loeve  trans¬ 
form.  This  hybrid  transform  has  been  computed  in  real  time  with  minimum  complexity  and  memory  through  the  use  of  LSI  bucket 
brigade  or  charge  coupled  devices  and  conventional  digital  DPCM  implementation. 
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ABSTRACT 

Iranslorin  coding  techniques  have  heen  esplored  extensively  in  theoretical 
studies  and  by  simulation.  It  lias  been  shown  iltat  a  signilicant  bandwidth  reduction 
can  he  achieved  in  many  applications  with  minimal  image  degradation  and  relative 
tolerance  to  channel  errors.  The  major  drawback  of  transform  image  coding  for 
real  time  television  applications  in  the  past  has  been  computational  complexity. 

I  ransforms  and  algorithms  have  been  demonstrated  w  ith  recently  developed  charge 
coupled  ilevice  and  acoustic  surface  wave  technologies  which  makes  nearly  opfiimim 
image  transform  coding  feasible  at  real  time  television  rates. 


INTRODUCTION 

Unitary  transforms  for  image  encoding  have  been  used  for  intraframe  encoding.  * 
In  addition,  these  techniques  may  also  be  applied  to  interframe  and  multispectral 
encoding  However,  all  unitary  transformations  are  information  preserving  and  no 
bandwidth  reduction  results  from  the  application  of  the  transform  to  the  image.  In¬ 
stead.  the  transforms  redistribute  the  variance  associated  with  each  picture  element 
(pixel)  so  that  subsequent  to  the  transform,  basis  restricting  operations  on  the  trans¬ 
form  coefficients  will  result  in  bandwidth  reduction.  Upon  reconstruction  of  the 
original  image  from  the  basis  restricted  transform  coefficients,  a  degraded  version  of 
the  original  image  can  be  obtained.  Unfortunately,  the  interrelationship  between 
the  type  of  transform,  the  form  of  the  noninvertible  operation,  and  the  type  of 
degradation  in  the  reconstructed  image  is  very  complicated  and  subjective.  The  uni¬ 
versally  used  analytic  criterion  of  the  mean-square-error  is.  at  present,  the  best  com¬ 
promise  technique  for  transform  comparison. 

l  or  the  particular  operation  of  basis  restriction  by  truncation,  a  particularly  sim¬ 
ple  interpretation  of  the  bandwidth  reduction  can  be  made.  The  transforms  may  be 


*To  be  published  in  NATO  Advanced  Study  Institute  Series.  Series  t  Applied  Sciences  No.  12.  (NoordhoffLeydenl97Sj. 
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viewed  as  a  variance  redistributing  operation  that  approMmalcly  decorrcLitcs  the 
transform  coefficients  while  transforming  the  variance  associated  with  each  picture 
element  into  the  low-order  coefficients  of  the  transform  Lhider  the  assumption  that 
each  set  of  picture  elements  can  be  considered  as  a  s  imple  function  from  a  w  ide 
sense  stationary  random  process  with  correlation  function  r'^*.  there  exists  an  opti¬ 
mum  discrete  transformation,  the  Karhunen-Loeve  transformation,  which  totally 
decorrelates  the  transform  coefficients  and  maximally  compacts  the  variance  to  the 
low-order  coefficients.  All  other  transformations  can  be  compared  in  their  perform¬ 
ance  by  comparing  their  transform  coefficient  decorrelation  and  variance  compaction 
with  this  optimum  transformation. 

This  intuitive  int^erpretation  can  be  made  rigorous  through  the  use  ol  the  rate- 
distortion  criterion".  It  has  been  found  from  experience  that  the  closer  the  eigen- 
■vectorsof  the  transformation  are  to  the  eigenvectors  of  the  optimum  K.irliunen- 
Loeve  transformation  the  greater  the  vari.mce  is  eoiiip.ielCil  and  the  more  the 
coefficients  can  be  truncated  while  maintaining  a  fixed  rate  distortion  or  mean- 
square-error. 

The  use  of  two-dimension  transforms  can  provide  improved  perlormance  over  the 
use  of  transformations  on  a  line-by-line  basis  '.  The  most  direct  .ipproach  is  to  seek 
a  two-dimensional  transform  which  simultaneously  decorrelates  the  transform  co¬ 
efficients  and  compacts  the  variance  into  a  corner  of  the  two-dimensional  transform 
coefficient  space.  One  method  is  to  find  a  two-dimensional  transform  which  can  be 
represented  as  the  product  of  a  transform  in  one  direction  and  a  transform  in  the 
other  direction.  Assuming  that  a  two-dimensional  picture  can  be  considered  as  a 
sample  function  from  a  random  process  with  two-dimensional  correlation  r,  '’’i  '  i; 
i.e.,  with  a  correlation  coefficient  r,  in  direction  one  and  a  correlation  coefficient  r, 
in  direction  two,  then  the  optimum  discrete  transformation  is  the  successive  use  of 
two  Karhenun-Loeve  transformalions;  the  first  with  parameter  r,  .  ami  the  second 
with  parameter  X2 

Also  of  interest  in  transform  encoding  is  block  si/e.  For  a  one-dimensional  signal 
the  block  size  is  the  number  of  elements  of  the  transform,  and  the  performance  of  the 
transform  improves  monotonically  with  increasing  block  size,  l  or  t  '.  o-dimensional 
images,  transform  performance  also  increases  with  increased  number  ol  elements  in 
each  dimension  of  the  transform.  However,  two  dimensional  transforms  usually  re¬ 
quire  intermediate  memory  to  store  the  transform  coefficients  in  the  first  direction 
while  the  transform  is  being  computed  in  the  second  direction. 

Alternatively,  two-dimensional  transforms  can  be  mixed  transforms,  i.e..  differ¬ 
ent  horizontal  and  vertical  transforms.  Performance  increases  with  the  number  of 
elements  in  each  direction  of  the  transform.  Flowever.  for  a  fixed  first  transform 
size,  memory  requirements  tend  to  increase  linearly  with  the  number  of  elements 
in  the  second  transform  direction  since  all  of  the  coefficients  must  be  stored  from 
the  first  transform.  The  amount  of  intermediate  memory  is  minimized  by  the  use 
of  a  small  block  size  for  the  image  in  the  second  direction,  but  performance  depends 
critically  on  the  choice  of  the  second  transform.  I  hus.  the  choice  of  a  mixed  trans¬ 
form  interacts  with  the  overall  system  design  and  the  ivailable  memory  for  coeffi¬ 
cient  storage. 
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TR  \NSI ORM  t\(  OI)IN(. 


Knrhiinon-Loeve  transt'oniKition 

il  j  Lonliiuioiis  tiiiK'  luiKtion  ol  zero  mean  aiul  autocorrelation  lunction  R(t)  = 

|,^  coiisidereil  to  lie  a  sample  tuiu  tion  Irom  a  wnle-sense  stationary  random 
process,  then  this  time  lunction  can  be  explicitly  expanded  by  the  Karhunen-Loeve 
expansion"*  and  the  resulting  coelTicients  will  be  uncorrelated  For  a  discrete  I'unc- 
tion  ol  zero  mean  and  autocorrelation  t'unction  Rlrt  =  r'^L  which  may  be  considered 
as  a  sample  lunction  Irom  a  lirst-order  Markox  process,  a  similar  discrele  Karluinen- 
1  oevc  li.inslormation  may  be  delined.^  This  translormation  diagonalizes  the  co- 
variance  malnx  aiul  is  optimal  in  the  mean-square-error  sense  lor  a  restricted  scl  ol 
basic  rimclioiis  thal  do  not  span  the  complete  space. 

The  discrete  Karhunen-1  oeve  expansion  is  given  by-''  lor  Ihe  case  N  =  2ni  as 


2m 


( 11,  - 


2  - ;  sin  |k -(2m  +  1  )'2|  +  nrr  2t  g„ 

.  ■’m  +  X  -  (  ' 


n=l  -m  +  ^^ 
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I  -  2r  cos  +  r- 
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and  uip  are  the  first  N  positive  roots  of 


tan  2mcj 


-(I  -  r")  sin  cu 
( eos  CO  -  2r  +  r~  cos  co) 


(3) 


Since  the  discrete  Karhunen-Loeve  expansion  involves  both  the  solution  of  a  trans¬ 
cendental  equation  and  the  evaluation  of  the  autocorrelation  function  of  the  data  to 
be  transformed,  real  time  computation  of  this  transform  is  quite  complex.  However. 
Habibi  and  Wintz*  have  shown  that  Karhunen-Loeve  transformations  calculated  using 
approximate  autocorrelation  functions  are  satisfactory  for  many  applications. 
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Discrete  Fourier  transturm 


StiKc  tlic  i  oorior  tr.insttirm  i*.  .iNSiupitUic  to  the  k.iihuiu  n-1  oc\c  ti.iu- 

lormatii'n”  lor  the  exponential  covariance  limetion  anJ  the  Imm'  n.Moin  are  pKliiic 
inilepeiuleiit.  the  ['(Hirier  translorm  representNa  louieal  clu'icc  lor  leal  lime  implcmcn 
tation.  Hie  Fourier  translorm  exists  lor  all  data  lengths  \  It  delineo  In 


N-l 

(;|.  =  2  t:,i  k  =  O.I.  \-l 

n=() 


Many  methods  exist  for  the  compulation  of  discrete  Fourier  eocl'ficienis  Ihe 
( ioertzel  algorithm  requires  a  numher  of  computations  proportional  to  N'  hi.i  can 
be  Used  for  all  lengths  N.  When  N  is  highly  composite  "last  Iranslormalions  i.an 
be  used."’  Thus,  if  N  is  of  the  form  then  the  number  ot  eompiitalions  e.m  be 
made  proportional  tc»  Nq.  Although  "tasl"  algorithms  have  been  suceesslulK  useil 
on  general  purpose  computers,  they  are  too  slow  lor  real  time  compulation  since 
the  algorithm  iterates  q  times  belore  achieving  a  solution  1  his  problem  c.in  be  o\i  i- 
eome  by  the  use  of  q  processors  in  a  pipeline  architecture.''^  although  this  mere.iscs 
the  complexity  of  the  processor. 

A  linear  filter  implementation  also  exists  for  the  discrete  Fourier  tr.inslorm  which 
IS  both  easily  implemented  and  suitable  for  real-time  computation  I  his  .ilgoi  ithm 
called  the  ehirp-Z  transform'’  is  baseil  on  the  substitution  2nk  -  ir'  ■*  k  '  -  in  -  kr 
and  can  be  used  for  any  length  sequence  N.  With  this  substitution  the  1)1  I  becomes 

N-l 

=  ^.-i7rk-/N  2  ^.itrtn  -  k)-/N  ^-irm'/N  i 

n=0 


The  transform  may  be  performed  as  a  premultiplication  by  a  discrete  chirp,  convolu¬ 
tion  with  a  discrete  chirp  of  twice  the  length,  and  postimiltiplication  by  a  disciete 
chirp.  This  convolution  may  be  computed  with  cither  acoustic  surface  w  ave  I  liters  or 
charge  transfer  devices.  1  ^ 

Discrete  cosine  transform 

Two  different  types  of  discrete  cosine  transform  (DCT)  are  useful  for  reduced  re¬ 
dundancy  television  image  transmission.  Both  are  obtained  by  extending  the  lenglli 
N  data  block  to  have  even  symmetry,  taking  the  discrete  Fourier  transform  tDT  f )  ol 
the  extended  data  block,  and  saving  N  terms  of  the  resulting  DFT.  Since  the  DFT 
of  a  real  even  sequence  is  a  real  even  sequence,  either  DCT  is  its  own  inverse  if  a 
normalized  DFT  is  used. 

The  “Odd  DCT”  (ODCT)  extends  the  length  N  data  block  to  length  2N-1 .  with 
the  middle  point  of  the  extended  block  as  a  center  of  even  symmetiy.  The  “Fven 


DC  I  ”  1 1  D(  I  I  cMcnds  ihc  lont:lli  N  Mock  lo  Icnglli  2N.  with  a  center  ol'  even 
NymincliA  located  hetwecii  the  two  points  nearest  the  middle  For  example,  the  odd 
lenjith  cxiension  ol  Ihc  sc<|iience  A  B  C  is  (BAH  C.  and  the  even  length  is  C  B  A 
A  B  (  In  hoili  c.iscs.  Ihc  symmeln/aiion  eliminates  the  jumps  in  the  periodic  ex- 
lension  *>1  ihedala  ldo«.k  vs  Inch  wvxilil  occur  it  one  edge  of  the  data  block  had  a 
high  v.ihic  aiul  Ihe  other  edge  had  a  low  value;  in  etfect  it  perlorms  a  sort  oF  smooth¬ 
ing  operation  with  no  loss  ol  inlormalion.  It  will  be  noted  that  the  terms  “odd'’ 

.ind  “even’’  in  OIK'T  anil  I  I)C1  reler  only  to  the  length  of  the  extended  data  block 
in  both  cases  ihc  exleiulcil  data  block  has  even  symmetry. 

Bolli  types  ol  D(  r  in;iy  be  impicincnicil  using  comp.ict.  high  speed,  serial  access 
li.inlw.iic.  in  sinicUircs  similar  lo  those  previously  described  tor  the  Chirp-/  trans- 
lonii  It  /  I  I  implementalion  ol  the  Dl  I 


lei  the  il.ii.i  sei|iieMce  be  g().  g  I . I^N-I  H'v' DIH  1  of  g  is  delined  as 
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By  sir.iiglillorw.inl  siibstiintion  it  may  be  shown  that 
\_|  -i2ffnk 
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where  g  is  delined  by  ei|iiation  DD. 
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Hie  ideniily  ( ItU  may  be  used  to  obtain  the  CZT  form  of  the  OIK'T  shown  in 
eiination  1 1  1 1 


2nk  =  n"  +  k"  -  In  -  kl~ 
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The  tDCT  ot  g  is  defined  by  equation  (12).  where  the  exteiuied  soqueiKe  is  di. 
fined  by  equation  (13). 
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for  k  =  0.  J  ,  .  .  \  - 
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If  tlie  mutually  complex  conjugate  terms  in  equation  (12)  are  combined,  then 
equation  ( 14)  results.  Hquation  ( 14)  may  be  viewed  as  an  alternate  way  of  defining 
the  FDfT 


Equation  ( 14)  may  be  put  in  the  C'ZT  format  given  in  equation  ( l.S  ) 


Gk  =  2  Re 
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Ahmed  has  investigated  the  use  of  the  EDCT  as  a  substitute  for  the  Karhunen- 
Loeve  transform  and  finds  that  it  is  superior  to  the  Fourier  transform  and  is  com¬ 
parable  to  the  Karhunen-Loeve  (K-L)  in  rate-distortion  performance  while  maintain¬ 
ing  the  computation  simplicity  of  a  transform  which  does  not  depend  on  the  picture 
statistics.  Habibi*  -  has  shown  by  simulation  that  the  DCT  is  equivalent  in  a  mean- 
square>error  sense  to  the  K-L  transform  under  basis  restriction. 
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I  igurc  1  shows  the  pcrloriiiaiiee  ot  llie  Iwo-diiiKMisioii.il  DC  I  on  .1 
sampled  picture  with  S  bit  quantization  P  is  the  hit  error  prohahiliiv  ol  the  trans¬ 
mission  channel. 


Hybrid  transforms 

Mixed  trunslonm  can  be  used  either  lor  intraframe  encoding  or  intei  Irame  encod¬ 
ing  depending  on  the  available  memory  and  the  type  of  transforms  implemented.  In 
order  lor  u  mixed  transform  to  be  competitive  with  one  of  the  conventional  two- 
dimensional  transforms,  it  must  offer  either  superior  perlormance  or  simplicity  of 
implementation.  Of  the  transforms  examined  the  odd  length  cosine  transform  is 
competitive  in  performance  since  it  can  be  implemented  as  the  real  part  of  a  ('/.  I 
and  since  the  transform  samples  are  real  and  are  the  samples  of  an  aulocorrelalion 
lunction  which  may  then  be  extrapolated  by  well-known  techniques.  In  simulations 
ol  transform  performance,  the  cosine  transform  has  been  shown  to  closely  approxi¬ 
mate  the  behavior  ot  the  Karhunen-I.oeve  tiansibrm. 

I  he  benefits  of  mixed  transformation  implementation  and  minimum  memory 
may  be  achieved  for  digital  tran.smission  by  combining  a  one-  or  two-dimensional  uni¬ 
tary  transform  with  generalized  DPC'M  in  a  hybrid  system.  I'he  basis  of  operation  of 
the  hybrid  transform  is  that  the  unitary  transform  decorrelates  the  image  within  its 
constraints  ot  transform  type,  dimensionality,  and  block  si/e.  while  the  generalized 
l)P('M  removes  the  correlation  between  traiislorm  blocks.  This  hybrid  system  is  par¬ 
ticularly  attractive  lor  remote  sensor  application  since  it  has  been  found  that  its  per- 
formance  is  approximately  as  good  as  the  Karhunen-Loeve  transform  and  its  imple¬ 
mentation  requires  minimum  memory. 

Idgure  2  shows  the  performance  of  a  hybrid  DC  I  /DI’CM  simulation.  For  a  bit 
error  probability  of  P  =  0,  there  is  no  significant  difference  between  the  two- 
ilimensional  DCT  and  the  DCT/DP(  M.  When  bit  errors  do  exist  in  the  transmission 
channel,  however,  the  hybrid  system  is  capable  of  better  performance.  This  is  shown 
for  a  bit  error  probability  P  =  l(r-.  Figures  3.  4  and  5  show  the  simulation  of  the 
hybrid  DC'  I  /DPCM  performance  tor  pictures  of  other  scenes  with  different  statistics. 


.SYSTLM  IMPLEMENTATION 

Two  hybrid  DCT/DPC  M  bandwidth  reduction  systems  have  been  selected  for 
construction  and  evaluation.  A  block  diagram  of  the  systems  is  shown  in  figure  b. 
The  first  uses  a  C'harge  Injection  Device  (C'lD)*-^  image  sensor  and  a  Bucket  Brigade 
Device  (BBD)  transform  implementation.  I  he  second  uses  an  ordinary  vidicon 
sensor  and  a  Charge  ('('•  pied  Device  ((  (  !))  transform  implementation. 

In  the  (  ID  system  a  100x100  pixel  solid  state  sensor  will  be  used.  The  nominal 
horizontal  line  scan  will  be  one  millisecond  The  nominal  frame  rate  will  be  10 
frames  per  second  which  can  be  displayed  without  Hicker  through  the  use  of  a  scan 
converter.  The  I  millisecond  line  scan  time  was  chosen  in  order  to  match  the  sensor 
!o  the  BBD  filter  which  operates  at  a  clock  rate  of  KXJ  kHz  with  good  charge  trans¬ 
fer  efficiency.  At  10  frames  per  second,  image  motion  should  be  reproduced  well 
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Cos/DPCM  0.5  bits/pixel,  P  =  0  Cos/DPCM  0.5  bits/pixel,  P  =  1 0"^  Cos/I)P( 


Transmitter 


SOURCE  ENCODER 


DISPLAY 


Kifiiirc  (i.  ItiKipe  'ri.msmissic>n  SysU'in 


enough  lor  iiKiny  applic;itioiis,  even  tiiougli  some  pieliire  ilel.nl  will  he  losi  hee.iiise 
ol  tlte  low  spatial  sampling  alYorded  by  the  100x101)  pixel  lormal. 


Minimum  overall  hit  rate  will  heaehie\ed  h\  a  eomhm.ilion  ol  zonal  I illering  anil 
variable  bit  assignment  with  low  spatial  Ireiiuencics  assigned  more  hits  oT  quanli/ation 
than  high  spatial  trei|ueneies.  *  'I'able  I  shows  the  hit  rate  which  results  Iron)  three 
overall  bits/pixel  assignments  at  a  pixel  rate  of  lo"'  pixels/sec  An  overall  bit  assign- 
meni  ol  I  bit/pixel  should  result  in  a  signal-to-distortion  ratio  of  approximately  YO 
ilB.  In  addition,  since  channel  errors  occur  in  the  houner  domain,  channel  error 
rates  as  large  as  I*  =  10"-  will  still  provide  uselul  reconstructed  miages. 


The  second  bandwidth  reduction  system  will  be  compatible  with  a  standard  vidi- 
con  camera.  It  will  use  (CD  tilters  tor  the  cosine  translorm  which  will  operate  at  a 
4,N  Mil/  sampling  rate  with  a  block  size  ok  .C  pixels  (  ompatibilily  with  standard 
television  lormat  will  he  maintained  in  as  many  aspects  as  possible  H  the  interlace 
tield  is  used  directly  as  the  input  to  the  transrorm  hardware,  a  resolution  ol  approxi¬ 
mately  4S0  lines  by  25b  pixels  is  possible  at  (lO  fields/sec  T  his  is  equivalent  to  a 
video  bandwidth  slightly  less  than  2,5  MM/,  Table  2  shows  the  bit  rate  which  re¬ 
sults  for  the  video  portion  of  the  field. 


Table  1,  Bit  Rate  as  a  Function  of  (Quantization  for  the 
(’ID  .System 
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I  .ihk'  2  Hil  Kate  as  a  I  unction  of  Ouantization 
lor  tlic  Vidicom  Compatible  System 


Bits/Pixel 

Bit  Kate 
Megabits/sec 

-> 

d.b 

1 

4.K 

1,/: 

:.4 

I  he  implementation  iil  selected  transl<ums  via  transversal  filters  was  proposed  at 
the  Ml  Applications  I'  gital  Computer  Conference. ' The  computation  ol  the  dis¬ 
crete  I  (uirier  transform  has  been  demonstrated  using  surface  wave  devices,  The 
discrele  cosine  transform  has  been  demonstrated  using  bucket  brigade  devices.*-^ 

I  he  1)(  I  implementation  block  diagram  is  shown  m  figure  7.  The  convolutions 
are  performed  in  both  television  systems  by  charge  transfer  devices  built  by  Texas 
Instruments  I  he  multiplications  are  performed  by  conventional  circuitry.  Figure  N 
Is  an  espaiuled  version  of  figure  7  showing  individual  components.  The  multipliers 
,md  adders  can  be  put  on  the  same  chip  as  the  transversal  filters  so  that  the  complete 
LOsine  ti.iiislorm  can  be  comptited  using  two  chips.  The  reference  functions  used  in 
the  pre-  and  posl-nuiltiply  can  be  stored  in  a  read  only  memory. 

I  igiire  o  IS  the  implementation  nlock  diagram  for  the  DPCM  part  of  the  system. 

I  he  memory  is  a  line  store  which  is  used  as  the  predictive  element  in  the  DPCM. 

I  he  cosine  translonn  coefficients  stored  in  the  memory  are  subtracted  from  the 
new  translonn  coefficients  as  they  enter  the  DPCM.  The  difference  is  quantized  and 
transmitted  with  a  selectable  number  of  bits  per  coefficient,  the  number  being  a  func¬ 
tion  of  the  assumed  variance  of  the  coefficient.  The  difference  coefficients  are  then 
.aided  to  the  previous  line  coefficients  to  create  the  predictive  element  for  the  next 
line 


-(N  -  1)  (N  -1) 


0<n<  (N  -  1) 


0<m<  (N  -  1) 


Figure  7.  Serial  Access  Implementation  of  tlie  DCT. 
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CONCLUSION 


ri\e  use  ol  au  \i\lratiaiuc  hyhrid  traiislormation  composed  of  a  hori/ontal  uni¬ 
tary  discrete  cosme  translorm  and  a  vertical  tirst-ordcr  l)P(  M  has  been  shown 
througii  simulation  to  liave  perl'ofmance  closely  approximating  tliat  ol  a  two- 
dimensional  Karhenun-Loeve  transform.  This  hybrid  transform  may  be  computed 
in  real  time  with  minimum  complexity  and  memory  through  the  use  of  LSI  bucket 
brigade  or  charge  coupled  devices  and  conventional  digital  DPCM  implementation. 
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APPENDIX  K 

FREQUENCY  SYNTHESIS  V  A  THE  DISCRETE 
CHIRP  AND  PRIME  SEQUENCE  ROMS 


1  ly  .gULMCY  SYNTHESIS  VIA  THE  DISCRETE 
(IIIRP  ANTO  PRIME  SEQUENCE  RONE 


James  M.  Alsup 
Harper  J.  Whitehoiise 
Naval  Undersea  Center 
San  DiegOj  CA  92132 


ABSTRACT :  Surface  acoustic  wave  sa)H)lod  data  filters  can  be  utilized 
as  serial  access  read-orQy-menx)ries  to  directly  inpleiiient  at  carrier  freq¬ 
uencies  a  coherent  fast -frequency-hop  synthesizer  in  the  VHP  and  UHF  ranges. 
An  exanple  of  frequency  synthesis  using  surface  acoustic  wave  discrete  chirp 
filters  is  shown.  A  frequency  synthesis  scheme  using  a  surface  acoustic 
wave  prime- sequence  filter  is  described. 


To  be  published  in  the  Proceedings  of  tlie  IEEE,  Special  Issue  on 
Surface  Acoustic  Wave  De\nccs,  May  1976 
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FREQUENCY  SYNTHESIS  VIA  THE  DISCRETE 
CHIRP  AND  PRIME  SEQUENCE  ROMS 


Introduction. 

Many  types  of  frequency  synthesizers  have  been  developed  over  the  years  [1], 
but  only  recently  have  read-only-memory  (ROM)  synthesizers  begun  to  reach  maturity. 
This  has  occurred  primarily  because  of  the  advent  of  sine-cosine  lookup  tables  im¬ 
plemented  in  digital  hardware.  However,  other  types  of  ROM  devices  are  becoming 
available,  and  in  particular  surface  acoustic  wave  (SAW)  devices  are  starting  to 
serve  this  purpose.  ROM  synthesis  is  particularly  suited  for  applications  which 
require  coherent  fast  frequency  hopping.  SAW  ROM  technology  makes  possible  the 
direct  extension  of  ROM  synthesis  into  the  VHF  and  UHF  ranges. 

ROM  Synthesis. 

There  are  several  ways  to  synthesisze  a  sampled  sinusoid  using  discrete  ROMs. 

One  method  utilizes  one  period  of  a  sampled  sinusoid  stored  in  sequential  order 
in  a  random  access  ROM  [2,3].  If  all  samples  of  the  ROM  are  read  in  sequence  with 
a  sample  time  interval  At,  then  a  sinusoid  of  period  T  and  frequency  fo  is  generated, 
where  T  =  N  At  =  l/fg*  However,  if  every  kth  sample  is  read  with  the  same  sample 
interval  At,  then  the  frequency  of  the  generated  sinusoid  is  kfp.  Since  k  can  take 

on  the  values  0,1,..., N-1,  a  total  of  N  frequencies  which  are  all  harmonically  re¬ 
lated  can  be  generated.  The  sampling  ambiguity  called  aliasing  may  be  evident 

depending  upon  whether  complex  or  real  sinusoids  are  generated. 

A  second  method  for  the  generation  of  sampled  sinusoids  has  special  application 
In  SAW  technology  because  it  utilizes  serial  access  ROMs.  This  method  consists  in 
multiplying  the  outputs  of  two  N-sample  discrete  chirp  ROMs  whose  serial  readouts 
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are  mutually  processed  by  m  samples  to  synthesize  the  frequency  mfg.  For  complex 
ROMs  of  the  form  ex()(j  n  n^/M ) ,  the  product  takes  ttie  form 

exp(j7rn^/N)  exp(-j  TT(n-m)^/N)  =  exp( j2rrmn/N)  exp{-j  Km^/N)  (1) 

where  n  corresponds  to  the  time  index  and  m  corresponds  to  the  frequency  index. 

The  right-hand  side  of  this  equation  can  be  interpreted  as  a  sampled  complex 

sinusoid  of  frequency  index  m  multiplied  by  a  complex  phase  shift  which  is  dependent 
only  on  m.  a'’ so  note  that  expCjnn^/N)  is  a  periodic  function  with  period  Nzit. 

For  frequency  hopping  applications,  either  of  these  tv.'c  methods  inquires  a 
hopping  interval  equal  to  or  greater  than  T  =  N  ..it  so  that  ut  least  one  period  of 
each  frequency  is  synthesized  with  a  comn'ion  non.  tial  handwidih  less  than  or  equal 

to  1/T.  Thus,  the  maximum  hopping  rate  for  these  typ*s  of  synthesizers  is  equal 

to  Tq.  Synthesis  may  take  place  at  a  carrier  frequency  f^,  m  which  case  fo  refers 
to  the  "fundamentaT'  frequency  which  would  be  observed  if  the  comb-like  band  of  har¬ 
monics  to  be  generated  at  the  carrier  f^.  were  shifted  to  baseband. 

bynthesis  via  read-only-memory  is  attractive  because  ii  makes  possible  the 
coherent  generation  of  a  harmonic,  group  of  discrete  frequencies.  These  frequencies 
all  will  start  with  known  initir.'l  phases,  and  lo.i.  of  ciiis  a  ijriori  inffrmation  bi 
made  to  build  cc'hcrent  receivers  ca[»ab!e  of  !  eccKjnizing  particular  phase  relationsh  ii 
among  sequentially-produced  tone  bursts  wittiin  the  operating  band  of  the  synthesizer. 

ROM  synthesis  is  also  attractive  because  nearly  instantaneous  changes  can  be 
made  when  hopping  from  one  frequency  to  the  next.  In  contrast,  a  phase  lock  loop 
synthesizer  invariably  requires  several  periods  of  the  waveform  being  synthesized 
before  a  stable  lock  is  acquired. 


Discrete  Chirp  SAW  ROMs. 

Discrete  chirp  transversal  filters  have  been  used  as  elements  in  CZT 
processor  systems  [4:5].  Such  filters  may  be  regarded  as  acoustic  ROMs  and  used 
to  implement  coherent  frequency  synthesis.  Periodic  impulsing  of  such  a  SAW 
discrete  chirp  filter  will  result  in  periodic  generation  of  the  function 
exp( j 7rn2/N  on  a  carrier,  so  that  tv/o  such  ROMs  operating  at  carrier  frequencies 
f]  and  fz  can  provide  the  necessary  signals  to  generate  a  tone  burst  over  the 
duration  of  the  ROM  outputs.  This  tone  is  obtained  by  delaying  one  signal  with 
respect  to  the  other  and  multiplying  the  two  ROM  outputs: 

p(t-n  Zit)  cos[2>rfit  +  rrn^/N]  p(t-n  Zit)  cos[2  rrf2t  -  )T(n-m)^/N] 

o  ,  (2) 

=  p'^(t-nAt)  cos[2  7r(fi+f2)t  +  2rfmn/N  -  nm^/N]  +  (fi-f2)-tenn, 

where  p(t)  is  the  sampling  window  (e.g.,  p(t)  =  1,  0<t<  At,  and  p(t)  =  0 
otherwise).  The  second  SAW  chirp  ROM  output  is  identical  to  that  of  the  first  one 
except  that  it  is  delayed  by  an  amount  m  At,  and  its  modulation  is  the  complex  conju¬ 
gate  of  that  of  the  first  ROM  (equivalent  to  a  negative  frequency  slope).  Since 

m  can  vary  over  the  interval  0,1 . N-1  in  an  arbitrary  order,  control  over  the 

hopping  sequence  is  flexible. 

Discrete  chirp  ROMs  are  also  useful  since  they  can  be  used  to  generate  a  given 
sinusoid  within  the  band  of  operation  for  an  arbitrary  length  of  time.  This  property 
is  important  for  applications  where  frequency  hopping  is  required  on  a  part-time  bas  i 
only.  Continuous  SAW  chirp  filters  can  also  be  used  as  ROMs  in  a  similar  sum  or  dif 
ference  frequency  scheme J  but  care  must  be  taken  to  account  for  possible  discontinuitie; 
where  end-points  of  the  chirp  function  are  encountered. 

Figure  1  illustrates  an  experimental  setup  used  at  NUC  to  perform  frequency 

r 

synthesis  using  SAW  discrete  chirp  ROMs,  where  fi=f2-  The  base-band  output  wavefonr 
is  shown  in  Figure  2,  and  could  oe  filtered  additionally  to  make  the  sampled 
waveform  continuous. 

T - 

Some  work  in  this  area  has  recently  come  to  our  attention  [11^12], 
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Figure  3  shows  .i  proposed  configuration  for  precise  autiipatic  control  of  the  hopping 
sequence.  It  mdkes  use  of  SAW  programmable  diode  multipliers  such  as  have  been 
described  in  [6],  and  incorporates  a  clocking  structure  on  the  same  substrate  such 
as  has  been  described  in  [7,8]. 

Prime  Sequence  ROMs. 

A  third  method  for  ROM  frequency  syritfiesis  involves  the  use  of  a  serial  access 
memory  in  conjunction  with  a  permuter  so  as  to  achieve  the  equivalent  of  a  random 
access  ROM.  Figure  -1  illustrates  such  a  configuration  for  the  special  case  when  the 
ROM  is  a  SAW  prime-sequence  filte*^.  Such  a  filter  has  been  designed  and  constructed 
at  NUC  to  implement  discrete  Fourier  transforms  via  the  prime  transform  algorithm 
[9,10].  Ihis  filter  has  as  its  impulse  response  ihe  function  ex:p(j2  7Tp/N) ,  where 
p  =  mod  N,  k  =  1,2,,..,N-1,  and  R  is  a  primitive  root  of  the  prime  number  N. 

■■■hus,  this  SAW  filter  can  be  used  as  an  acoustic  ROM  to  generate  permuted  values 
of  a  sampled,  complex  sinusoid  on  a  carrier. 

The  attraction  of  this  particular  ordering  of  the  ROM  samples  is  apparent  in  the 
structure  of  the  auxilliary  memory  used  to  operate  the  tap  permuter  (Figure  4).  This 
memory  can  be  implemented  -n  the  form  c.f  a  cyclically  operated  sequentially- addres'.cd 


binary  ROM  whose  start  inej  piisition  relative  to  the  |iei  iodic  imiiulsinq  of  the  SAW 
acoustic  ROM  determines  the  output  frequency.  N-1  frequencies  can  bc^  generated,  but 
for  hopping  rates  slower  than  -  1/(N-1)  a  complex  sample  value  (1,0)  must  be 
inserted  into  the  synthesized  output  every  N-1  sample*  intervals. 

Such  a  SAW  ROM  synthesis  scheme  might  be  a  reasonable  alternative  to  the 
chirp  method  if  sample  permuters  became  available  whose  perfomance  exceeded  that 
of  the  chirp  multipliers  in  terms  of  output  accuracy. 


Conclusions. 


The  feasibility  of  using  surface  acoustic  wave  devices  as  principal  elements 
in  ROM  frequency  synthesis  has  been  demonstrated.  In  particular,  the  discrete  chirp 
SAW  device  is  well  suited  because  of  its  ability  to  function  at  the  carrier  of 
interest,  its  inherent  phase  predictability,  its  fast-hop  capability,  and  its  cyclic 

nature  in  the  generation  of  long-duration  single  frequency  waveforms.  Such  SAW 
devices  also  have  the  characteristic  properties  of  relatively  light  weight,  low 
power,  and  small  size,  and  should  be  considered  for  use  as  direct  synthesizers  in 
the  VHP  and  UHF  frequency  regions  and  for  those  applications  where  the  hopping  rate 
required  may  exceed  the  capabilities  of  conventional  digital  techniques. 

Acknowledgment. 

This  work  was  supported  in  part  by  Defense  Advanced  Research  Projects  Agency, 
Order  Number  2303,  Code  Number  3610,  monitored  by  Col.  H.M.  Federhen. 


1 

I 


References. 

[1]  Kroupa,  Venceslav  F.,  Frequency  Synthesis.  Halsted  Press/John  Wiley,  New  York, 
1973. 

[2]  Tierney,  Joseph,  Charles  M.  Rader,  and  Bernard  Gold,  "A 

^  ^  gy«»hoci7Pr''  IFEE  Trans.  Audio  and  Electroacoustics.  Vol.  AU-19.  March 

1971,  pp.  48-57: 

[3]  Hosking,  Rodger  H.,  "Direct  Digital  Frequency  Synthesif-,"  j973  IEEE  Jnterco£ 
Technical  Papers.  Section  34/1,  New  York,  1973. 

[41  Alsup  J.M.,  R.W.  Means,  and  H.J.  Whitehouse,  "Real  Time  Piscretc  Fourier  Trans- 
formsUsing  Surface  Acoustic  Wave  Devices,"  Proc.  lEt  International  Specialist 
Seminar  on  Component  Performance  and  Systems  Applic^ i oji_o f^urjface  Acousti_c 
Wave  Devices^  Aviemore,  Scotland,  September  24-28,  1973. 

[5]  Alsup,  J.M. ,  "Surface  Acoustic  Wave  CZT  Processors,"  Pt^t-edings  1974  Ultra¬ 
sonics  Symposium,  Milwaukee,  Wis.,  Nov.  1974,  pp.  378-381. 

[6]  Reeder,  T.M.,  "Electronically  Variable  Chirp  Signal  Correlation  with  the  Diode 
Correlator,"  IEEE  International  Microwave  Symposium  Digest,  Atlanta,  GA, 

June  1974,  pp.  237-P39. 

[7]  Lee,  L.L..  B.J.  Hunsinger,  and  F.Y.  Cho,  "A  SAW-Stabili?ed  Pulse  Generator, " 

IEEE  Trans.  Sonics  and  Ultrasonics,  Vol.  SU-22,  March  1975,  pp.  141-142. 

[8]  Gilden,  M. ,  T.M.  Reeder,  and  A.J.  DeMaria,  "The  Modc-Lockrd  SAW  Oscillator," 

IEEE  1975  Ultrasonics  Symposium  Proceedings,  Los  Angeles,  CA,  Sept.  1975, 

Paper  P-3. 

[9]  J.M.  Alsup,  "Prime  Transform  SAW  Device,"  IEEE  1975  Ultrasonics  Symposium 
Proceedings,  Los  Angeles,  CA,  Sept.  1975,  Paper  G- /  i 

[10]  Rader,  C.,  "Discrete  Fourier  Transforms  When  Ihe  Number  ui  Data  :>amples  is 
Prime,"  Proceedings  IEEE,  Vol.  56,  19(>8,  pp.  1107  11  Of- 

[11]  Ataeni,  C.,  6.  Manes,  and  L.  Masotti,  "Signal  Processing  by  Analog  Chirp- 
Transfonnation  Using  SAW  Devices,"  IEEE  1975  Ultrasonjes  Syffl£os.ium_Pro- 
ceedings,  I  os  Angeles,  CA,  Sept.  1975,  ?aperG-6. 

[12]  Grant,  P.M.,  D.P.  i  oan  and  O.H.  Collins,  "Cieneration  and  Correlation 

of  Digitally-Controlled  Coherent  Frequency-Hopped  Waveforms  Using  Surface 
Acoustic  Wave  Devices,"  submitted  to  Proceedings  IEEE,  Letters,  Oct.  1975. 


r 


I 


1 

I 

I 


3 

a. 

*j 

3 

O 

u  • 

a;  CM 
N 

II 

(0 

0)  B 
JZ 

*-»  y-s 

c  a 

in 

•o 

c  •-* 

JZ  II 
d) 

U3  B 

fC 

co 

J2 
X  ^ 
c 
a 

CL  o 

V4 

•H  II 

s: 

o  e 

3 

<  (TJ 

iO 


0) 

3 

00 


:o5 


ROMs  built  onto  same  substrate;  Output  taps  spaced  at  One-half 
Discrete  ROM  tap  Interval. 


Out 


APPENDIX  L 

EXPONENTIAL  RESIDUE  CODES 


209 


LXI-ONIiNTIAL  RKSIDUE  CODhS 


.1.  M.  Al  .np  and  J.  M.  SpcLser  * 


Abstract  -  This  iu)tc  describes  a  class  of  pulse  compression  codes  which 
were  discovered  liy  c,\.uiiining  the  properties  of  the  prime  transform  algorithm 
[1).  For  each  prime  I’,  phase  hop  sequences  of  length  P-1  can  be  constructed 
whose  periodic  autocorrelation  functions  have  constant  sidelobe  height  of  -1 
relative  to  the  main  peak  value  of  P-1. 


For  each  prime  P,  there  exists  at  least  one  integer  R  such  that  the  resi¬ 
dues  modulo  P  of  R,  R^,  •  .  .  ,  R**'*  are  all  distinct  and  non-zero,  and  hence 
form  a  permutation  of  1,  2,  ,  P-1  [2,  3].  Tlie  number  of  integers  pos¬ 

sessing  this  property  for  a  given  prime  (P)  is  given  by  the  Euler  0-function, 
C^CP-l)  [2,  4]>. 


The  code  corresponding  to  a  selected  integer  (  primitive  root  ')  R  is  the 
sequence  W  ,  W"  ,  .  .  .  ,  W  ,  where  W  =  exp(j27t/P),  j  =  y-1.  This  sequence 
is  designated  an  exponential  residue  code,  since  individual  terms  are  formed 
by  eyponentiation  of  the  residue  of  (j27TR*^/P)  modulo  2nj ,  which  exponent  is 
equivalent  to  (j  27r/p)  (R^odulo  P).  This  sequence  can  also  be  recognized  as  a 
particular  scrambled  ordering  of  samples  of  cosine  and  sine  functions  with  sam¬ 
pling  interval  inversely  proportional  to  P. 

The  corresponding  periodic  autocorrelation  function  is 

n=l  n=l  ^-1,  k  otherwise. 

To  prove  the  above  identity,  note  that  when  k  is  between  1  and  P-2  that  R*"  is 
not  equal  to  1,  and  thus  (R^-1)  is  non-zero.  The  exponents  in  the  sum  therefore 
run  through  the  same  P-l  vlaues  (in  different  order)  as  the  sequence  R*^  itself 
does,  so  that  the  sum  totals  -1. 

To  be  published,  IEEE  Transactions  on  Aerospace  and 
Electronic  Systems.  November  ic,7«;-  - 
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Since  R'~*  mcxiulo  P  is  congruent  to  1  [2],  the  last  term  in  the  sequence  of 
P-1  elements  is  always  the  element  W,  and  is  always  cyclically  followed  by  the 
element  W*.  Therefore,  the  codes  corresponding  to  different  primitive  roots  are 
distinct,  and  there  are  <f>  (P-1)  distinct  codes  of  length  P-1. 

Careful  ai\alysis  of  the  Euler  0-function  reveals  that,  for  prime  P>3,  0  (P-1) 
is  always  even.  Furthermore,  each  primitive  root  R  has  an  "associate"  S  belonging 
to  the  set  (1,  2,  .  .  .  ,  P-1)  such  that  the  product  R*S  modulo  P  is  equal  to  one, 
i.e.,  S  ■  R~*  [2].  Thus, 

S  *  R'-‘  R->  =  R"-2  ■  (2) 


and 


S"=  R'"= 


(3) 


That  is,  the  associate  S  is  also  a  primitive  root  which  can  be  used  to  generate  a 
sequence  W*,  W*  ,  .  .  .  ,  Vhich  is  equivalent  to  the  sequence  W"  *, 

.  .  .  ,  W*,  W.  Thus,  we  can  state  that,  for  P>3,  the  0 (P-1)  primitive  roots  occur 
in  pairs  such  that  one  member  of  a  pair  generates  a  sequence  which  is  the  (shifted) 
reverse  of  that  generated  by  the  other  member. 

An  exponential  residue  code  is  a  shifted  version  of  its  own  complex  conjugate. 
This  can  be  seen  as  follows: 


W 


-CR’'). 


w 


R^Cp^D 


(4) 


where  n'  =  n  +  (P-l)/2,  and  the  relationships  w'’=l;  (P-1)^  modulo  P  -  1;  and  R'’"' 
iiKDdulo  P  1  Iiave  been  utilized.  The  corresponding  correlation  relation  also  holds: 

P-1  /•d"'*  /'d"+S  jP'l.  "  (P-l)/2 

^  )  w(P  )  =  ^  (5) 

n=l  /-1>  k  otherwise. 

The  autocorrelation  function  of  the  exponential  residue  phase  hop  codes  may 
also  be  derived  by  considering  Rader's  prime  transform  algorithm  [1]  when  the  sig¬ 
nal  to  be  analyzed  is  one  of  the  basis  vectors  of  the  discrete  Fourier  transform, 
and  hence  the  output  of  the  prime  transform  is  a  Kronecker  delta.  Other  pulse 
compression  codes  are  known  which  use  the  analog  to  a  logarithm  in  modulo  P 
arithmetic  [5,  6].  Frequency-hop  codes  of  a  type  related  to  exponential  residue 
phase  hop  codes  are  also  of  interest  and  have  been  examined  by  Cooper  and  Yates  f7J. 


*From  [2],  when  a  number  A  is  written  in  terms  of  its  prime  factors  a,  b,  c,  etc: 

then  is  given  by 


A  =  3°^  b®  etc, 


^A)=  etc. 


C6) 

(7) 
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